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Appendix  A.  Implementations  of  Algorithms  for  Computing 
the  Sample  Covariance  Matrix  on  a Vector 
Pipeline  Processor 

All  the  implementations  are  written  in  an  easily  understood, structured 
programming  language.  These  implementations  are  presented  for  operation 
count  purposes  only  and  are  not  intended  to  be  compilable  code.  For  a 
discussion  of  these  implementations,  please  see  Section  4. 3.2. 2 "Calculating 
the  Sample  Covariance  Matrix  on  a Vector  Pipeline  Processor." 

In  the  following  discussion,  sample  covariance  matrix  will  be  abbre- 
viated to  SCM  and  sample  vector  to  SV.  MR  and  MI  will  be  the  arrays 
containing,  respectively,  the  real  and  imaginary  parts  of  the  SCM  and  will 
be  floating-point  (FP).  XR  and  XI  are  FP  arrays  containing  the  real  and 
imaginary  parts  of  the  SV.  In  front  of  each  implementation  will  be  a list  of 
variables  with  their  types  (floating-point  (FP)  or  integer  (I)),  their 
lengths  (as  a function  of  the  number  of  weights  (N)  and  the  number  of 
samples  (NS)),  and  their  contents  and  storage  scheme  (contents  omitted 
for  MR,  MI,  XR,  XI;  storage  scheme  is  vectorwise,  componentwise,  etc.). 

Vectors  are  denoted  by  double  subscripts  like  MR(K,L),  which  means 
the  L-word-long  vector  starting  at  location  K of  array  MR.  Vector  oper- 
ations are  denoted  by 

1)  A(K1,L1)  = B(K2,L1)  ± C(K3,L1) 

2)  A(K1,L2)  = D(K2,L2) 

3)  X = SUM[A(K2,L3)] 

which  are  equivalent  to,  respectively, 

1)  FOR  L0C=0  to  Ll-1 

A(K1  + LOC)  = B(K2  + LOC)  t C(K3  + LOC) 

END  FOR 
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2)  FOR  LOG  = 0 to  L2-1 

A(K1  + LOG)  = D{K2  + LOG) 

END  FOR 

3)  X = 0 

FOR  LOG  = 0 to  L3-1 
X = X + A(K2  + LOG) 

END  FOR 

If  a non-vector  appears  to  the  right  of  the  equal  sign  in  case  1)  or  2), 
it  is  treated  as  though  it  were  a vector  of  the  appropriate  length  con- 
taining the  constant  value  of  the  non-vector. 

FOR-END  FOR  loops  behave  as  they  should:  if  the  initial  value  is 
greater  than  the  final  value  and  the  step-size  is  positive  (default  = 1), 
or  if  the  initial  value  is  less  than  the  final  value  and  the  step-size 
is  negative,  the  loop  is  skipped  and  control  passes  to  the  first  state- 
ment following  the  END  FOR. 
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Implementation  1 — Unpacked  Storage 


VARIABLE 

TYPE 

LENGTH 

CONTENTS 

MR 

FP 

N^ 

rowwi se 

MI 

FP 

n2 

rowwise 

XR 

FP 

N 

XI 

FP 

N 

ROW 

I 

1 

current  row  of  SCM 

START 

I 

1 

starting  location  of 

current  row  of  SCM 


BEGIN 
START  = 1 

C UPDATE  EACH  ROW  OF  THE  MATRIX 
FOR  ROW  = 1 TO  N 

C UPDATE  THE  REAL  PART 

MR(START,N)  = MR(START,N)  + XR(ROW)*XR(l .N)  + XI (ROW)*XI (1 ,N) 
C UPDATE  THE  IMAGINARY  PART 

MI(START.N)  = MI(START,N)  + XR(ROW)*XI (1 ,N ) - XI (ROW)*XR(l ,N) 
START  = START  + N 
END  FOR 
END 
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Implementation  1— Unpacked  Storage 


VARIABLE 

TYPE 

LENGTH 

CONTENTS 

MR 

FP 

N(N+l)/2 

rowwise 

MI 

FP 

N(N+l)/2 

rowwi se 

XR 

FP 

N 

XI 

FP 

N 

ROW 

I 

1 

current  row  of  SCM 

START 

I 

1 

starting  location  of  current 
row  of  SCM 

LENGTH 

I 

1 

length  of  current  row  of  SCM 

BEGIN 
START  = 1 


C UPDATE  THE  1 THROUGH  N-2"“  ROWS  OF  THE  MATRIX 
FOR  ROW  = 1 TO  N-2 

C UPDATE  THE  REAL  PART 

MR (START, LENGTH)  = MR(START, LENGTH)  + XR(ROW)* 

XR(ROW, LENGTH)  + XI (ROW)*XI (ROW, LENGTH) 

C UPDATE  THE  IMAGINARY  PART 

MI (START  + 1, LENGTH  - 1)  = MI (START  + 1 , LENGTH  - 1) 

+ XR(ROW)*XI(ROW  + 1, LENGTH  - 1) 

- XI(ROW)*XR(ROW  + 1, LENGTH  -1) 

START  = START  + LENGTH 
LENGTH  = LENGTH  - 1 
END  FOR 

C UPDATE  THE  REAL  PART  OF  THE  N-r'"  ROW 

MR(START,2)  = MR(START,2) 

+ XR(N-1)*XR(N-1,2)  + XI(N-1)*XI(N-1,2) 

C UPDATE  THE  IMAGINARY  PART  OF  THE  N-lst  ROW 

MI(START+1)  = MI(START+1)  + XR(N-1 )*XI (N)  - XI(N-1 )*XR(N) 
C UPDATE  THE  REAL  PART  OF  THE  Nth  ROW 

MR(START+2)  = MR(START+2)  + XR(N)*XR(N)  + XI(N)*XI,N) 

END 
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Implementation  2--Unpacked  Storage 


\RIABLE 

TYPE 

LENGTH 

CONTENTS 

MR 

FP 

n2 

rowwise 

MI 

FP 

N^ 

rowwise 

XR 

FP 

N 

XI 

FP 

N 

f 

TRl 

FP 

N^ 

temporary  arfay  containing 
a stretched  form  of  XR 

TR2 

FP 

N^ 

temporary  array  containing 
a stretched  form  of  XR 

Til 

FP 

N^ 

temporary  array  containing 
a stretched  form  of  XI 

TI2 

FP 

n2 

temporary  array  containing 
a stretched  form  of  XI 

ROW 

I 

1 

current  row  of  SCM  being 
stretched 

START 

I 

1 

starting  location  of  cur- 
rent row  of  SCM 

LENGTH 

I 

1 

length  of  whole  SCM 

BEGIN 

C STRETCH  OUT  SAMPLE  VECTORS 

START  = 1 
FOR  ROW  = 1 to  N 

C STRETCH  THE  REAL  PARTS 

TRl (START, N)  = XR(1,N) 

TR2(START,N)  = XR(ROW) 

C STRETCH  THE  IMAGINARY  PARTS 

TI1(START,N)  = XI(1,N) 

TI2(START,N)  = XI (ROW) 

START  = START  + N 
END  FOR 

C UPDATE  MATRIX 

LENGTH  = N*N 

MR(1, LENGTH)  = MR(1, LENGTH)  + TRl (1 .LENGTH )*TR2(1 .LENGTH) 
+TI1 (1  ,LENGTH)*TI2(1 .LENGTH) 

MI (1, LENGTH)  = MI (1, LENGTH)  + TRl ( 1. LENGTH)*  TI2(1 .LENGTH) 
- Til (1 ,LENGTH)*TR2(1 .LENGTH) 
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END 


Implementation  2— Packed  Storage 


VARIABLE 

TYPE 

LENGTH 

CONTENTS 

MR 

FP 

N(N+l)/2 

rowwise 

MI 

FP 

N(N+l)/2 

rowwise 

XR 

FP 

N 

XI 

FP 

N 

TRl 

FP 

N(N+l)/2 

temporary  array  con- 
taining a stretched 
form  of  XR 

TR2 

FP 

N(N+l)/2 

temporary  array  con- 
taining a stretched 
form  of  XR 

Til 

FP 

N(N+l)/2 

temporary  array  con- 
taining a stretched 
form  of  XI 

TI2 

FP 

N(N+l)/2 

temporary  array  con- 
taining a stretched 
form  of  XI 

ROW 

I 

1 

current  row  of  SCM 
being  stretched 

START 

I 

1 

starting  location  of 
current  row  of  SCM 

LENGTH 

I 

1 

length  of  current  row 

of  SCM,  then  length 
of  whole  SCM 


BEGIN 

C STRETCH  OUT  SAMPLE  VECTORS 
START  = 1 
LENGTH  = N 
FOR  ROW  = 1 to  N-1 
C STRETCH  OUT  REAL  PARTS 

TR1( START, LENGTH)  = XR(ROW, LENGTH) 

TR2 (START, LENGTH)  = XR(ROW) 

C STRETCH  OUT  IMAGINARY  PARTS 

Til (START, LENGTH)  = XI (ROW, LENGTH) 

TI2 (START, LENGTH)  = XI (ROW) 

START  = START  + LENGTH 
LENGTH  = LENGTH  - 1 
END  FOR 

TRl (START)  = XR(N) 

TR2 (START)  = XR(N) 

Til (START)  = XI (N) 

TI2(START)  = XI (N) 

r UPDATE  MATRIX 

LENGTH  = N*(N+l)/2 

MR(1, LENGTH)  = MR(1, LENGTH)  + TRl (1  ,LENGTH)* 

TR2(1, LENGTH)  + Til (1 ,LENGTH)*TI2(1 , LENGTH) 

MI  (1, LENGTH)  = MI (1, LENGTH)  + TRl (1 , LENGTH)* TI2(1  .LENGTH) 
- TI 1(1, LENGTH)*TR2(1, LENGTH) 

END 
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Implefnentation  3 — Unpacked  Storage 


VARIABLE 

TYPE 

LENGTH 

CONTENTS 

MR 

FP 

rowwise 

MI 

FP 

n2 

rowwise 

XR 

FP 

N*NS 

real  parts  of  NS  SV's  stored 
componentwise  (i.e.,  1st 
component  of  all  NS  samples, 
followed  by  2nd  component  of 
all  NS  samples,  etc. ) 

XI 

FP 

N*NS 

imaginary  part  of  NS  SV's 
stored  componentwise 

T 

FP 

2*NS 

temporary  array  to  be  used 
with  SUM 

ROW 

I 

I 

row  of  current  element  of  SCM 

COLUMN 

I 

1 

column  of  current  element  of  SCM 

STARTM 

I 

1 

location  of  current  element 
of  SCM 

STARTMT 

I 

1 

location  of  transpose  of  cur- 
rent element  of  SCM 

STARTXR 

I 

1 

starting  location  of  ROWth 
components  of  SV's 

STARTXC 

I 

1 

starting  locations  of  COLUMNth 
components  of  SV's 

NS2 

I 

1 

2*  nunber  of  SV's 

BEGIN 

COMPUTE  THE  INNER  PRODUCT  OF  THE 
EACH  ELEMENT  OF  THE  MATRIX 

NS2  = 2*NS 

STARTM  - 1 

STARTXR  = 1 

STARTXC  = 1 

FOR  ROW  ■=  1 to  N-1 

COMPUTE  DIAGONAL  ELEMENT 

DATA  FOR 

T(1,NS)  = XR(STARTXR,US)*XR(STARTXC,NS) 
T(NS+1.NS)  = XI{STARTXR,NS)*XI{STARTXC,NS) 
MR(STARTM)  = SUM(T(1 .NS2) ) 

STARTMT  = STARTM  + N 
STARTM  = STARTM  + 1 
C COMPUTE  OTHER  ELEMENTS 

FOR  COLUMN  = ROW  + 1 to  N 
C STARTXC  = STARTXC  + NS 

CALCULATE  REAL  PART 

T(1,NS)  = XR(STARTXR.NS)*XR(STARTXC,NS) 
T(NS+1,NS)  = XI(STARTXR,NS)*XI(STARTXC,NS) 
HR(STARTM)  = SUM(T(1.NS2)) 

MR (STARTMT)  = MR (STARTM) 

C WLCULATE  IMAGINARY  PART 

T(1,NS)  = XR(STARTXR,NS)*XI (STARTXC, NS) 

T(NS+1,NS)  = -XI (STARTXR, NS )*XR (STARTXC, NS) 
MI (STARTM)  • SUM(T(1.NS2)) 

MI (STARTMT)  = -MI (STARTM) 

STARTMT  = STARTMT  + N 
STARTM  = STARTM  + 1 
END  FOR 

STARTM  = STARTM  + ROW 
STARTXR  = STARTXR  + NS 
STARTXC  * STARTXR 
END  FOR 

C COMPUTE  LAST  DIAGONAL  ELEMENT 

T(1,NS)  = XR (STARTXR, NS )*XR (STARTXR, NS) 

T(NS+1,NS)  = XI(STARTXR,NS)*XI(STARTXR,NS) 

MR (STARTM)  » SUM(T(1 ,NS2)) 

END 
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Implementation  3— Packed  Storage 


VARIABLE 

TYPE 

LENGTH 

CONTENTS 

MR 

FP 

N(N+1)/2 

rowwise 

MI 

FP 

N(N+l)/2 

rowwise 

XR 

FP 

N*NS 

real  parts  of  NS  SV's  stored 
componentwise  (i.e.,  the  1st 
component  of  all  NS  SV's, 
followed  by  the  2nd  component 
of  all  SV's,  etc.) 

XI 

FP 

N*NS 

imaginary  part  of  NS  SV's 
stored  componentwise 

T 

FP 

NS 

temporary  array  used  with  SUM 

ROW 

I 

1 

row  of  current  element  of  SCM 

COLUMN 

I 

1 

column  of  current  element  of  SCM 

STARTM 

I 

1 

location  of  current  element  of 
SCM 

STARTXR 

I 

1 

starting  location  of  ROWth  com- 
ponents of  SV's 

STARTXC 

I 

T 

starting  location  of  COLUMNth 
components  of  SV's 

NS2 

I 

1 

2*  number  of  SV's 

BEGIN 

COMPUTE  THE  INNER  PRODUCT 
ELEMENT  OF  THE  MATRIX 

NS2  • 2*NS 

STARTM  = 1 

STARTXC  • 1 

STARTXR  • 1 

FOR  ROW  = 1 to  N-1 

OF  THE  DATA 

FOR  EACH 

C CWPUTE  DIAGONAL  ELEMENT 

TO.NS)  = xr(start;<r,ns)*xr(startxc.ns) 

T{NS+1.NS)  = XI(STARTXR,NS)*XI(STARTXC,NS) 
MR(STARTM)  = SUM(T(1 .NS2)) 

STARTM  = STARTM  + I 
C COMPUTE  OTHER  ELEMENTS 

FOR  COLUMN  = ROW  + 1 to  N 

STARTXC  = STARTXC  + NS 

C CALCULATE  REAL  PART 

T(1,NS)  = XR{STARTXR,NS)*XR(STARTXC.NS) 

T(NS+1,NS)  = XKSTARTXR, NS  )*XI  (STARTXC  .NS) 
MR(STARTM)  = SUM(T(1 ,NS2) ) 

C CALCULATE  IMAGINARY  PART 

T(1,NS)  = XR(STARTXR, NS )*XI (STARTXC, NS) 

T(NS+1,NS)  = -XI(STARTXR,NS)*XR(STARTXC,NS) 
MI (STARTM)  = SUM(T(1.NS2)) 

STARTM  = STARTM  + 1 
END  FOR 

STARTXR  » STARTXR  + NS 
STARTXC  = STARTXR 
END  FOR 

C COMPUTE  LAST  DIAGONAL  ELEMENT 

T(1.NS)  = XR(STARTXR.NS)*XR(STARTXR,NS) 

T(NS+1,NS)  = XI(STARTXl.NS)*XI(STARTXI,NS) 

MR(STARTM)  «=  SUM(T(1  ,NS2)) 

END 
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Appendix  B.  Implementations  of  Algorithms  for  Direct  Methods  of 
Solving  MW  = S on  Vector  Pipeline  Processors 

For  an  explanation  of  the  symbols  and  abbreviations  used  in  this  appendix, 
please  see  the  introduction  to  Appendix  A and  the  discussion  of  Section  4.3.2. 
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Gauss-Jordon  (GJ) 

Variables 

Type 

Length 

MR 

FP 

N*(N+K) 

MI 

FP 

N*(N+K) 

SR 

SI 

ROW 

I 

1 

DIVIO 

RP 

1 

SUBROW 

I 

1 

STARTR 

I 

1 

STARTS 

I 

1 

LENGTH 

I 

1 

DELTA 

I 

1 

Contents 

the  group  of  N+K  words  contains 
the  real  part  of  the  row  of  the 
SCM  followed  by  the  real  compo- 
nents of  K steering  vectors, 
imaginary  counterpart  of  MR 

contained  in  MR  and  MI 

current  row  of  SCM 

scale  factor  to  normalize  ROWth 

row  of  SCM 

row  from  which  multiple  of  current 
row  is  subtracted 

ROW+lst  location  of  ROWth  row  of  SCM 
ROW+lst  location  of  SUBROWth  row  of  SCM 
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ELIIlIl’ATE  nOV/TH  COVTIllI  CO  OIILY  rOV/TH  P.OV;  ILAS  1 'ill  IT, 

AtlD  THE  PEST  HAVE  EEROS 

FOR  ROVI  - .1  TO  n 

LEEGTII  - LEilGTH-l 
DIVIO  ■■  l./MIK  GTARTR-1) 

* RUT  A 1 in  THE  nOVJ-T!I  ROVJ 

I!P.(  CTARTR,LEi;OTU)  “ Iin(  STARTR .LEEGTII) ’"DIVIO 
HI(  CTAU'i'R,LEHGTH>  “ MI(  STARTE . LEEGTH)  "DIVID 
DTARTG  " ROV7H 

• ELIMinATE  FROII  THE  OVIIER  EOVIS 

. FOR  cnnr.oi;  - i to  n 

IF  Eunnov;  .he.  rov; 

i:u(  STA.RT0,LEr3TII)  - r!R(  STARTS  , LEEGTII) 

+ . . . - HR(  STARTS-.t)  ■••!"(  CTARTR.LTl'GTH) 

+ . . , + nx' CTARTS-.'I. )’"?!!(  GTARYR.LFRGTII) 

MI(  STARTS, LEEGTIl)  " MK  STARTS  ,LEEGT!I) 

+ , . . --  J:R(  STARTS-I )”!!!(  DTARTR^L  ’rST:!) 

+ . . . - IlK  STARTS-l)’''I!m  STARTR,LEl'.GT:n 

Elio  IF 

CTARTS  “ STARTS -iDELTA 
. KRO  FOR 

STARTP.  " STAnTR!-nR7,TA>I 
Elio  I'OR 

e:t) 

( I.'OTE  TEAT  IF  K"!  THE  VECTOR  Ol'EI’ATIOlTS  VriE’I  ROV?  "•  11  i;iLI, 

r.;:  ors  set  of  operatjos  loeg  aho  rmu.-.o  re  ciaeged  to 

SCALAR  OPERATIOLiS , THIS  iMI’LEHElVi'ATIOn  ASSURES  IC  > .1) 
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Gaussian  Elimination-GE-Decomposition 


Variable 

Type 

Length 

MR 

FP 

MI 

FP 

n2 

RECIP 

FP 

N 

ROW 

I 

1 

SUBROW 

I 

1 

STARTR 

I 

1 

STARTS 

I 

1 

LENGTH 

I 

1 

Contents 

rowwise  - unpacked 

rowwise  - unpacked 

reciprocals  of  components  of  D 

* 

in  decomposition  M = LDL 

current  row  of  SCM 

row  from  which  multiple  of 

ROWth  row  is  subtracted 

location  of  ROW+lst  word  of  ROWth  row 

location  of  ROW+lst  word  of  SUBROWth  row 

length  of  ROWth  row  starting  in 

column  ROW+1 
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EEGin 
LE'JGTll  ” H 
SV.ARTn  - 2 

C ELIMIHATn  nOV;TH  COAUICT  CO  IT  KAS  SEHOEC  EEI,OV7  A UNIT  DIAGONAL 


Fon  row  " .1  TO  n-2 

nncipcnovj)  - i./nw  starte-i) 

LEI'GTII  “ LEnGTH-l 

» IIAICE  UNIT  DIACOtTAL 

line  CTAr.TF.fliEIT'JT!!)  " r.n'0IP(  TOW)  ■'  !’r!  C'^AP.Tr.  .LEFOTII) 
I!I(  GTAnTi’wLCEGTil)  = nECIi’t  FX'W)  ' I’K  CTARTP , LEnoTIl) 

» DLiriltLATE  DELOW  THE  DIAGONAL 

CTARTC  " DTARTR 
FOR  ewnrew  " i;ov7  + i to  n 

CV’AP.VC  >'  C'^'ARTC  + N 

. MU' CTAr.TC.Ll.TOTII)  " MU'.  CTAUTC  .LrUTTH) 

+ . . cTAPTC  -.r)  MU'.  c''’A'>t;’,le::g?'i) 

+ . . i!i(  CTAivTC- 1. ) ■ nil  r/TAU';n , lengt:! > 

ni(  CTAnTC.LEMGTii)  - IU(  CTAU?:i  .LLMGTII) 

-!1U'.  GTAI’.Tfj  -l ) M;C(  G':''.\?.'iU,LSrGTU) 
-I!I(  G'TAUTG-l)  MuC  GTi'J’TU.LBI.OTII) 

JUii)  FOR 

GTARTi:  “ GTAF.TR  il  1 
END  Fpn 

Ei.ininATr;  n-xsT  coluixl) 


nnciin  n-.n 

i : TiuriU) 


1. . GTA.r.'in-.i ) 

ri;  oi"’"  " • ’ ) !’p'  GTAnTF.) 

F.;;-!.'..'-  I - MI(  GTAiiTU) 


CALCULATE  Rr.CIl’nOCAL  OF  I.AGT  DIAGOIAL  nLI’MENT 
G'TA.Il'/G  **  GTA.nTU  11 

ruj'ii'iu)  " .1 . /' nu' GTA.r.TG)  - .mui  gtau^u)  nui  gtauio 

■>  -I'nii  GTAUTU)  ni(  GTARTG  -.1)) 

EMO 
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★ 

Cholesky  without  square  roots-LDL  -Packed  decomposition  (the  only 
difference  between  the  packed  and  unpacked  version  is  the  calculation  of 
subscripts  and  pointers,  so  numbers  and  expressions  in  brackets  refer  to 
the  unpacked  version  and  replace  the  numbers  and  expressions  they  follow). 

Contents 


Variable 

Type 

Length 

MR 

FP 

N(N+l)/2  [N^] 

MI 

FP 

N(N+l)/2  [N^] 

TR 

FP 

N 

TI 

FP 

N 

RECIP 

FP 

N 

TRl 

FP 

1 

Til 

FP 

1 

ROW 

I 

1 

SUBROW 

I 

1 

STARTR 

I 

1 

STARTS 

I 

1 

STARTRS 

I 

1 

LAST 

I 

1 

LENGTH 

I 

1 

LENGTHS 

I 

1 

rowwise 
rowwi se 

temporary  array 

temporary  array 

reciprocals  of  elements  of  D 

* 

in  decomposition  M = LDL 

temporary  location 

temporary  location 

current  row  of  SCM 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

location  of  ROW+l^t  element  of 

ROWth  row 

location  of  SUBROWth  element  of 
SUBROWth  row 

location  of  SUBROWth  element  of 
ROWth  row 

location  of  last  element  of  SCM 
length  of  ROWth  row  starting 
at  element  ROW+1 
length  of  SUBROWth  row  starting 
at  element  SUBROW 
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DEGIU 

LAGV  - n»(nH)/2  [LAST  " II  • in 
SVAHTn  - 2 
LLUGXU  - IJ 

C CALCULATE  nOVJTII  r.0V7  OF  L*  FilCTOlI  III  M - LDL* 

Fon  now  " 1 TO  ii~2 


ILAICE  UllIT  DTACOI3A.L,  LAVE  nECIPr.CCAL 


EECIPtnOW)  - .1  ./!!?.(  CTAETn  - 1) 

I.EIIGTH  “ I.ErRTII  - 1 

vnc  now  .l,I,Ei;OTn)  - HECIPI  nOWl'-n-K  GTAPTn.LEITETH) 
TI(  nOWM.LEIlGTII)  “ HECIPC  r.OW)''f'.T(  ETARTll .LEEGTH) 


SUDTnACT  nULTIpLEa  OF  nOWTII  I'.CW  FEOIl  OTHER  130373 


LEtJGTir;  " LEIICTII 
ETAlViT.r,  •’  r.TA!3TU 

ET.'Vi'.T:;  r.TAI’.T13  •>  IJHTGTII  CSTAnTG  " GTARTU  > HI 
F013  G01',nOV7  - now  + 1.  'jO  17  - 2 

G TAFTS  , LEEGTH': ) " t'.PJ.  STAPTS  . T4EI1GT:I'3  1 

"T'H  s >ini’03  7)  '•!:?.!  STAFTPS  ,t.e;  IGTHG  ) 

-TK  r.ui'.now)  • S3i’AP.Ti;rj  wF’E  OTr.n ) 

IIK  GTAUTS  + .l,T,!JT.;GTl!S-.l>  " .W"' r.TAi’.TS  > ' P’:'0?:?S  - 1) 

-30j3(  r;tJr,!;ow)  fiTA''.Tr.:3  ,L’’::oT'n  - i) 
^•T'H  •'.•.inr.037)  GTAP.TRG  i-  I.,  LE'-TOTP-S  - .1) 
GTAj3TnS  •>  S3;'Ai:Ti:S  1, 

STAP.Y.'j  '■  S'.'Aj’.-.’r'j  ■lEIlGTIiS  CGTATITG  " GTA.PTS  * 17  .1.1 

lii.t.OTIIG  ''■*  IjLI.OTiiS  ■*  1 
EliD  FOi! 


GUETr..ACV  MULT.CPLE  OF  nOWTII  nO',7  ?.7-.1.: 


no:/ 


inu  ETAIIT.'; , 2 > - l?n{  JT.  'AnTG  , 2 > -'ilK  ll-.l ) .'JT.APTPG  , 2 > 

- T::<  ll-.l. ) f.]! ,;(  ET.AP.Tr.S  , 2 ) 

IlK  S'lAETS  1)  " 11.H  STAilTS  1 ) - TiK  1I--1  )•'!!£(  GTA13TP3  ■»•  1) 
TX(  ii-D't::;!  svAPvnii  i ) 
gtaht;  ■■  STAETiis  ^ i 


SUJiTn.A''”*.'  IITTLI 


;’LE  OF  EOWTH  IlOVI  FFO'!  lITri  iiCW 


line  7,7.ET ) " 1 '■:(  T A'l” ) -■  TIP  11 ) staftps  ) 

1’2(T!1-Hl2;  GVIillTFS) 


c 


riovE  t;  fitvaeeo  r‘'\!  V’)c:\  tp  mia  vt  to  ” ’ o;  o iii 


C CALCUI.ATil  ’i-rs'.'  P.r,-.7 

1 ^ '»•  1 „ 1 j 

1-1.1  . :/';\:i'';i) 

C 'Jl'.A'l'  I.'W.E  .1  ‘.'Ai..  j .. . 1 iPr.’_ 0. -. j 

ilil'llpdl)  ^ .1 

OK  Gl'A:;  1 •)  •• 

il  ' SMUlV.’l  " 


./d’  P'  AST)  - Vnl.'F’K  IF  AP  -’P)  - 

■■  .■  • ■..!)  ) 

'■'i;  I. 

TIG 


FE  11- 1. ' 
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Cholesky  with  square  roots-LL  - Packed  decomposition  (see  comments 

•k 

for  LDL  decomposition) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

rowwise 

MI 

FP 

N(N+l)/2  [N^] 

rowwise 

RECIP 

FP 

N 

reciprocals  of  diagonal  elements 

★ 

of  L in  decomposition  M=LL 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

STARTR 

I 

1 

location  of  ROW+lst  element  of 

ROWth  row 

STARTS 

I 

1 

location  of  SUBROWth  element  of 

SUBROWth  row 

STARTRS 

I 

1 

location  of  SUBROWth  element 

of  ROWTh  row 

LAST 

I 

1 

location  of  last  element  of  SCM 

LENGTH 

I 

1 

length  of  ROWth  row  starting  at 

column  ROW+1 

LENGTHS 

I 

1 

length  of  SUBROWth  row  starting 

in  column  SUBROW 


EECTtl 

LAGT  - U*(IJ  + l)/2  CLAST  - ll"!]] 
STAHTn  - 2 
LEKGVn  " M 


CALCULATE  ROVmi  ROW  OF  L*  FACTOR  111  tl  " LL* 

FOR  Rov;  “ 1 TO  - 2 

• CALCULA.TE  RECIPROCAL  DIAGOnAI,  AITO  RCVC 

RECXP(  ROV;)  " l./GORT  ( IIR(  STARTR-1)  ) 

LEKGT!!  “ LEEGTII  -1 

r.n(  GTARTR.LEEGTIl)  " RECTP(  ROV?)  «Mn(  flTAP.TR,LEIJG?:i) 

IlK  STARTR,LEI;GTII)  " REClin  RO\7)*MI(  STARTR,LHRGTH) 

• SUBTRACT  IIULTIPLEG  OF  P.OWTII  ROV/  FROM  OTHER  ROVJS 

LEnCTIlS  <■  LEHGTII 
'ARTRS  " STARTR 

STARTS  " START!:  + LEEGTH  CSTARTG  " STARTR  + H] 

FOR  sunnou  “ row  + i to  n-2 

MR!  STARTS  , I.EHGTIIS  ) " IlRt  STAR.TS  , DHJGTHS  ) 

-!iR(  STARTl’.S  ) ■•■nR(  STAR.TRS  , LTT.'GTHS  ) 

■•MK  f.TAi’TRS  ) •■7!T(  STAT'.TRS  .LEliGTHS ) 

IlK  STARTS  + .1  ,T,]:r.GTIiS  - I ) " I!. ;(  STARTS  I,  LEHGTHS  ■ 
-I'.R',  ST.ARTRS ) ''-flK  •■■AARTi’.S  + .1,  LE'.TCTHS  - 1 ) 
W.K  STARTRSl'-HRC  STARTRS  + A,  LERGTHS  -1) 
STARTRS  “ STARTRS  + .1 

ST-'.RTS  " STARTS  + LEHCTIIG  CGTARTS  ” STARTS  +11+13 
I.UEGTilS  “ AEEGTES  - A 
EllD  FOR 

SUBTRACT  I’.IJLTTPLE  O?  ROVJill  ROW  FRO.’!  W-AST  ROVI 

MR(  STARTS  , 2 ) ” I'R(  STARTS  ,2)  - F'R!  STARTRS ) " 

hr:  STARTRS  , 2 ) - J.'K  STARTRS ) ’■I'.K  STARTRS  , 2 ) 

IlKSTiARTS  + A>  ■’  IlK  STAR.TS  + 1)  - I i '(  STARTRS ) 

UK  STA.RTRS  + 1)  + MK  STARTilS ) ••■!R(  STARTRS  + 1) 
STARTR.:;  STARTRS  + 1 

SUBTR.ACT  no'LTEPT.E  O'?  rOViTII  ROW  FRCIl  HTH  ROW 


.nR(LAST) 

1;IIS  I'OR. 


!!R<  T AST)  - I'R(  ST.AF'.TRS  > '■■nR!  STARTRS  ) 

st.Art;’.  sta.::'.’;';;: 


STAR' 


y ;:  ;sT!i 


r;,'A\R-' 


.11 


c: 


!i-A 


r.ow 


REE.....'  (il-A!  ■'  A. /SORT  ( f'R;  ilTARTR  - 
i!r';;tartr)  ''  ( 1!-A)'Ee::  ss;  .;TR) 

ST.ARTR)  '■  R'"’.'EER  ( ! !- .A  ' ''.'"R  STA'".''.'.'’.) 


13  ) 


CAT.CULATA 
Vi:'.  ;.TR(  II) 
IWJ 


iIT;i  !)v;\C0-!AL  REEERROCAA 

' A . /R')'  -’  ' a.-.st3-i::r  i^ta!;'.;:)  ■"!  ■::(  saaa:er) 

saaa  aw’  i.lI  r.TAiA ) 


1) 
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Gaussian  Elimination  with  Vectorwise  Augmentation-GE- 


Variable 

Type 

Length 

Contents 

MR 

FP 

rowwise 

MI 

FP 

n2 

rowwise 

SR 

FP 

KN 

real  part  of  K steering  vectors 

vectorwise 

SI 

FP 

KN 

imaginary  part  of  K steering  vectors 

vectorwise 

RECIP 

FP 

N 

reciprocals  of  components  of  D in 

•k 

decomposition  M=LDL 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

STARTR 

I 

1 

location  of  ROW+lst  word  of  ROWth  row 

STARTS 

I 

1 

location  of  ROW+lst  word  of  SUBROWth 

row  starting  at  column  ROW+1 

LENGTH 

I 

1 

length  of  ROWth  row 

AUG 

I 

1 

current  steering  vector 

STARTSV 

I 

1 

location  of  ROW+lst  element  of 

current  steering  vector 

STARTSVM 

I 

1 

location  of  first  element  of 

current  steering  vector 
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BEGin 

LEKGTH  - N 
STAHTIl  - 2 


C ELiniHATE  nOVITlI  COLUlUJ  SO  IT  IIAS  ZEBOES  DELOVJ  A UNIT  DIAGONAL 


FOR  now  " 1 TO  N - 2 

RECIP( ROVn  “ 1 . /MR(  STAUTR  - 1) 
LENGTH  - LENGTH  - 1 


C • MAKE  UNIT  DIAGONAL 

MR(  CTARTR, LEKGTH)  = RECIP!  ROVJ)  * I !?.(  STARTR .LENGTH) 
MK  STARTR, LENGTH)  " REG  I P(  NOV/)  * IlK  STARTR, LENGTH) 


C • ELIMINATE  BELOV)  THE  DIAGONAL 


STARTS  “ STARTR 
FOR  SUBROV)  ROW  + 1 TO  N 
STARTS  “ STARTS  + N 

MR' STARTS,  LENGTH)  = MP.(  STARTS  .LENGTH) 

+ . . -MR{  STARTS  - 1 ) * MRC STARTR, LENGTH) 

+ . . + ni(  STARTS  - .).)«  MK  STARTR, 7,ENGTH) 

HI(  STARTS, LENGTH)  = MK STARTS .LENGTH) 

+ . . -Mn(  STARTS-i)  MK  STARTR, LENGTH) 

+ . . - MI { STARTS  - 1)  * nn(  STARTR, LENGTH) 

. END  FOR 


C * ELIMINATE  FROM  STEERING  VECTORS 


+ 

+ 


+ 


END 


S TARTS V - ROW  + 1 
FOR  AUG  - I TO  K 

SR(  CTARTSV, LENGTH)  " SRI  STARTS’/, LENGTH) 

- SRC  STARTS V-l>  ■'  MR!  STyARV.’R .LENGTH) 
-SK  STA.RTS7-1)  " MK  ST;\RTR  , LENGTH) 

, SI(  GTAr.TSV,LE):GTH)  " S.K  STARTS  /, LENGTH) 
+f;R(  STARTSV-l)  >■=  MK  STARTR, LENGTH) 
-SK  STARTSV-I)  MR(  STARTR, LENGTH) 
STARTSV  - STARTS'/  + N 
END  FOP. 


STARTR  " STARTR  + N + 1 


FOR 


ELIMIN.ATE  11- 1ST  COLUMN 

RR"'.”!’!  V-I  > " .1 . /.’EE  STARTR- 1 ) 

r:i;!  STARTR)  ■’  isMAAl ',i-l )"''’  MR(  STARTR) 

.'!>:(  :;t.'.rtp.)  ■■  ir.gip  (n-i  ) mhstartr) 


cat.gulate  nEGirr.ocAy,  c,; 


diagonal  ri.ehent 


nr’"'. It 

nncn'di)  " VV/i’rR;;  starts)  - riirsTAnTR)  !'r(  starts-). ) 

!•  MiK  START”)  MK  STARTS-l ) ) 

EL“Mir’\TE  FRO!!  SVEERTi’.-,  VEGTC'!:')  AND  DIVIDE  DT  DIAGONAL  ELEMENTS 


STARTS'/  “ H 
S'’AR'-'SVM  ” I 

Tin  % -M  r\  I-  1 r'(') 

'.  snt  STAP.TSV)  - SR!  STARTS”)  SR' ST.ARTSV-l ) >'  !!?.(  STARTR) 
+ . -si(  s^’ARTsv-.; ) sTAiri!'!) 

SK  START. A ) . RK  STARTSV ) : SI!!  STARV’.IV- 1 ) ~ MK  STARTR) 


. SRC  S!;ARTSVM,.  I)  “ IS.GKRl., 

fi-  , cir'n  nnnt-T  • ll^  pn.-i  — « 

! starts;’  •’  SVAr'’SV  :■  U 

i-ri  M-.r  I ,,  <•  I- • i-.ir.^T'  f 'i 

END  FOR 
END 


.si.AR';';!) 

I)  ” SR!  STARTSV;!,))) 
‘*,r' ■' ^ *1) 
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Gaussian  Elimination  with  Componentwise  Augmentation 


GE 


Variable 

Type 

Length 

Contents 

MR 

FP 

N-(N  + K) 

the  i^"  group  of  N+K  words  contains 
the  real  part  of  the  i^^  row  of 

the  SCM  followed  by  the  real  parts 
of  the  i^^  components  of  the  K 
steering  vectors. 

MI 

SR 

SI 

FP 

N-(N  + K) 

analogous  to  MR,  except  contains 

imaginary  parts 

1 contained  in  MR  and  MI 

RECIP 

FP 

N 

reciprocals  of  elements  of  0 in 

decomposition  M = LDL* 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

STARTR 

I 

1 

location  of  ROW+1 st  word  of  ROWth 

row 

STARTS 

I 

1 

location  of  ROW+1 st  word  of  SUBROWth 

row 

LENGTH 

I 

1 

length  of  ROWth  row  starting  in 

column  ROW+1 
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r 


BEQIM 

LEWGTII  - II  ♦ K 
STAP.TH  - 2 


C BLIMII’ATn  nOWTH  COT^UfCJ  SO  IT  IIAO  SEPOES  EELOM  A TOUT  DIAGOHAL 

ron  novj  » i to  ii  - i 

i;ccip(  nov/)  - i./nncsTAnvp  - i) 

LEUGTII  “ LEKGTII  - 1 

MAKE  UniT  DIAGOKAE 

Mn(  ETAnTn.LEKGTII)  " nECIPt  POV;)  • Mn(  GTAP.Tn.tSKGTID 

Mi(  STAnTn,i,nMGTn)  - p.ecip(  novn  " ni(  STAnTn.LEEGTii) 

ELIMIITATB  BELCVI  THE  EIAGOMAL 

f.TAP.TS  - GTARTR 
FOn  SUBPOV/  ■>  I’.OV/  + 1 TO  N 
GTAPTS  " STARTS  + 17  + K 
. MRC  STARTii , T.-THGTri ) - IIP/  STARTS  , AEMGT!! ) 

. -IIP/ S'.'AIiVS-I)  " IIP/ GTARTR, LnilC-TH) 

•iiiK  r.TAi:TG-.i)  •’  iiK  GTARTR, i,e:;gth) 

MI(  GTAnTG,LE!yjTlI)  - MI(  S'AARTG  ,",RI..GTiI) 

. -Iin(  STARTS- .1 ) * I!I(  STA.RTR , 'jEIIGTH) 

-UK  DTARTC-1 ) " HR(  sta.p.tr,j,ei;ot:i) 

linn  FOR 

STARTP.  “ STARTR  + 17  + K + 1 

E17E  FOP. 

RECTPCIi)  J.  ./MR' SO'ARTR-l) 

MR!  GTARTR,!'.)  - RECTPI  17)  I!R(  GTARTR , lO 

UK  GTARTR, K)  - P.ECIIH  17)  '•  UK  GTARTR., !0 

C 

ni7D 

c 

(HOTS  THAT  IF  K " 1,  TIIR'I  Till:  FIITAJ,  ?.  VETTOP. 

OPEPATIOLIG  HTTP.  OIILV  ni:  CIi:-;  SET  OL’  OPERAirOS 
;,Oi:G,  i'.liR  S:iOIU,D  DE  OR'.l/J';')  TO  SCAIuAR  .jRjlll.ATIOII.j . ) 
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Cholesky  without  Square  Roots  with  Vectorwise  Augmentation-LDL  - 
(see  comments  for  LDL*  Decomposition) 


Variable 

Type 

Length 

MR 

FP 

N(N+l)/2  [N^] 

MI 

FP 

N(N+l)/2  [N^] 

SR 

FP 

K-N 

SI 

FP 

K-N 

TR 

FP 

N 

TI 

FP 

N 

RECIP 

FP 

N 

TRl 

FP 

1 

Til 

FP 

1 

ROW 

I 

1 

SUBROW 

I 

1 

STARTR 

I 

1 

STARTS 

I 

1 

STARTRS 

I 

1 

Contents 

rowwise 
rowwi se 

real  parts  of  K steering  vectors 
vectorwise 

imaginary  parts  of  K steering 
vectors  vectorwise 
temporary  array 
temporary  array 

reciprocals  of  elements  of  D in 

* 

decomposition  M = LDL 

temporary  location 

temporary  location 

current  row  of  SCM 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

location  of  ROW+lst  element  of 

ROWth  row 

location  of  SUBROWth  element  of 
SUBROW th  row 

location  of  SUBROWth  element  of 
ROWth  row 


Cholesky  without  Square  Roots  with  Vectorwise  Augmentation-LDL*  - “"rJ 


(see  comments 

for  LDL 

Decomposition)  (Cont'd)  ’ ^ 

Variable 

Type 

Length 

Contents 

LAST 

I 

1 

location  of  last  element  of  SCM 

LENGTH 

I 

1 

length  of  ROWth  row  starting  at 

element  ROW+1 

LENGTHS 

I 

1 

length  of  SUBROWth  row  starting 

at  element  SUBROW 

AUG 

I 

1 

current  steering  vector 

STARTSV 

I 

1 

location  of  ROW+1 st  element  of 

current  steering  vector 

STARTSVM 

I 

1 

location  of  first  element  of 

current  steering  vector 
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c 


c 


c 


c 


c 


c 


BEGIII 

LAST  - H*<H+l)/2  CLAST  - N*N] 

STAHTn  - 2 
LE13GTH  - M 

CALCULATE  ROWTII  ROW  OP  L*  FACTOR  IN  H - LDL* 
FOR  ROV/  - 1 TO  H-2 


HAKE  UNIT  DIAGONAL,  GAVE  RECIPROCAL 

RECIPC  ROW)  “ 1 . /MR(  GTARTR-I ) 

LENGTH  " LENGTH- 1 

TR(  ROW+1 .LENGTH)  - RECIPf  ROV/) •MR<  STARTR, LENGTH) 
TI(  r.OW+ 1 .LENGTH)  - RECIP(  ROV))  •MI(  GTARTP., LENGTH) 

SUBTRACT  MULTIPLES  OF  ROVmi  ROV/  PROM  OTHER  ROWS 


+ 

+ 

♦ 

+ 


LENGTHS  " LENGTH 
STARTP-S  - STARTR 

STARTS  “ STARTR  + LENGTH  CSTARTS  ” STARTR  + NJ 
FOR  SUBROV/  " ROV/  + 1 TO  N-2 
. MR(  STARTS , LENGTHS ) - MRf  STARTS , LENGTHS ) 

-TR(  SUBROV/)  "MR(  STARTRS  , T.ENGTIIS ) 

-TI(  SUBROV/)  "MK  STARTRS , J.ENGTHS ) 

MKSTARTS  + l .LENGTHS-).)  “ /!:i(  STARTS  + I, LENGTHS  - 1) 
-TR(  SUnnOW) "UK STARTRS  + 1, LENGTHS  - 1) 

+ TI(  SUBROW) ’-HRC  STARTRS  ♦ 1 .LENGTHS  - 1) 

STARTRS  “ STARTRS  + 1 

start;;  - starts  + lengths  [starts  » starts  > h ii 

I.ENGTIIS  " LENGTHS  - 1 
Et/D  FOR 


SUBTRACT  MULTIPLE  OP  ROV/TII  ROV/  PROIl  N-IGX  ROV/ 


MRC  STARTS  , 2 ) - MR(  STARTS  , 2 ) -TR(  IJ- 1 ) »'I4R(  STARTRS  , 2 ) 

+ . -TI(  U-.l ) *tll(  STARTRS , 2 ) 

MI(  STARTS  + 1)  “ MI(  STARTS  + 1)  -TR(  N-1 ) '•riK  STARTRS  ♦ I) 
+ . +TI(N-.1)'-MR(  STARTRS  + 1) 

STARTRS  ■>  STARTR:;  + 1 


SUBTRACT  irJLTIPI.E  OP  ROV/TII  ROV/  PROIl  NTH  ROV/ 


MRILAST)  '■  nR'L.AST)  - TR(  N )’’■/’?.(  ST.ARTRS) 

-TI(  11 ) '■MX(  SlARTRS  ) 

• MOVE  NORiALTRED  ROW  PROM  TR  AN.D  TI  TO  HR  AND  MI 
MR!  STARTR,  JJlilGTH)  " TRI  ROV/  + .V  , .LFIIGITI) 

iiK  STARTR, )M:’.:;;T;i)  “ now  :•  .1.  .ltjnoxII) 

STARTR  “ STARTR  + LENGTH  1 [STARTR  " STARTR  >•  N + 1) 

« SUBTRACT  .’'.’ILTIPT.n  OP  CyRRENT  ROW  PRO.*!  ST-ERRINO  VECTORS 


STARTSV  “ ROW  •!  ?. 

FOR  AUG  - ).  VO  K 

. SRI  START;; /.jMJNS'..:;;)  " sr;  start;;’.', length) 

. -SRI  ;;yAi’.vsv  -.;.  > ”tr(  ruwh  .j.ehgth) 

:;a\:!xia  -v)«tt<  row;). ,xiR';GTn> 

S.a  STAR-SV.LRRGT.')  " STI  ST.VATS  / , " Ei:  1 OA) 

•^sr:  staptsv-).)*;’-'  ro'/;-).  .LruT?:!) 
. -;;t(  ;'.VARTSV-.1) I r.ov;+i  .length) 

ST.Ar.'''S'.'  " GT.ARTSV  + N 
END  FOR 


END  FOR 
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C CAIXULATE  H-IST  HOV/ 

nEciP(n-.i)  - i./tin( fjTAr.Tn-i) 

TRi  “ nncip  ( sTAn-i’R) 

Tn2  “ nEciP(  H-l  > *n:L(  staptr) 

C CALCULATE  IITII  DIA.gOEAL  EECimOCAL 

HEciPdj)  " 1 . /( np(  i.AST)  - Tni  ” im(STAnTR) 

+ -TI.l-'HK  OTAPTn)  ) 

iincsTAUTn)  “ vni 
MKLTAnTn)  “ Til 
ETAn’i'EV  '■  n 
STAnTEVIl  “ 1 
FOn  AUG  " ,l  TO  !C 

r>n(  DTAETrA')  ■■  nn(rjTARTGV)  - sn(  otap-tsv-d  " tpi 

•+.  -r.K  ETAP.Tr.V-.l ) * Til 

f.K  GTAr.Tr.V)  = fil'ETAHTEV)  r.n(  ETAnTGV  - 1)  * Til 
+ . -GI(  UVAI’TOV-I  ) »'  Tni 

r.E(  ETiV.’.TEVI!,n)  " nECIP(  l,n)*EP.(  GTAPTGVn.n) 

. SI(  STAnTSVI!,!!)  » EECIP(  1 ,11) ’-UK  UTAnTtjVH.II) 

ETAnTSV  “ GTAP.TEV  + tl 
ETAP.TDVM  " STAUTSVn  + H 
nno  FOR 
EITO 
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Cholesky  without  Square  Roots  with  Componentwise  Augmentation  - LDL-  ' 

* 

(see  comments  for  LDL  Decomposition) 

Variable  Type  Length  Contents 

MR  FP  [N^+KN]  dividing  the  array  into  consecutive 

groups  of  length  N+K,  N+K-1,  N+K-2, 
....  1+K  [N+K],  the  i^^  of  these 
groups  contains  the  N+l-i  [N] 
real  parts  of  the  i^^  row  of  the 
SCM,  followed  by  the  real  parts 
of  the  i^"  components  of  the 
K steering  vectors 


MI 

FP 

+I(N  [N^+KN] 

analogous  to  MR,  except  it  contains 

the  imaginary  parts 

SR 

1 

i 

1 

> contained  in  MR  and  MI 

SI 

) 

1 

TR 

FP 

N+K 

temporary  array 

TI 

FP 

N+K 

temporary  array 

RECIP 

FP 

N 

reciprocals  of  elements  of  D in 

★ 

decomposition  M = LDL 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

STARTR 

I 

1 

location  of  ROW+lst  element  of 

ROWth  row 
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★ 

Cholesky  without  Square  Roots  with  Componentwise  Augmentation  - LDL 

★ 

(see  comments  for  LDL  Decomposition)  (Cont'd) 


Variable 

Type 

Length 

STARTS 

I 

1 

STARTRS 

I 

1 

LAST 

I 

1 

Contents 

location  of  SUBROWth  element  of 
SUBROWth  row 

location  of  SUBROWth  element  of 
ROWth  row 

location  of  last  element  of  SCM 
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DEGIIl 

LAST  ” !C"(N-1)  + tl"(r7+l)/2  CLAST  - K»<H-1)  + H"N] 
STAHTR  - 2 
LLUGTH  - n+K 

C CALCULATE  ROWTII  ROV;  OP  L^PACTOR  ItJ  11  " LDL» 

FOR  r.OW  - 1 TO  17-1 


c * riAKE  uniT  oiAcnrAL,  cave  rccipnocAL 

RnCIP(noW)  » 1 ./rip.(  STARTR-l) 

LEKOTH  “ Lni.'GTH  -1 

TiK  r.ou+1  .LEraiTi)  " nncipc  Rovnmnc  GTAP.TR.LEKGTn) 
Ti(  nov;>i,LEi;cTU)  " p.ecipc  nov7)*nT{  s'i’AnTR,7.Ei30T:i) 

C • SUDTRACT  MULTIPLES  OF  ROIV-TH  ROM  PROM  OTHER  ROWS 


J.EEGTIIS  " LEllGTII 
SVARTRS  - STARTR 

START!:  = STARTil  !•  LE17GTII  [STARTS  “ STARTR  + M ♦ Kl 
FOR  SIIEROW  - ROW  + .1  TO  17 

. MR(  STARTS,  LERGTnS)  " MR!  STARTS  , LEt’GTi'S  J 

-yn(  cur, ROW)  '-mr!  startrs  , tairotiis  > 

“T:c(  saiiRov;)  'viK  startrs  .lerotiis  ) 
ni(  STARTS  + .l  .I.lRT/TWS-l)  ■'  MI(  STARTSM  , LErsTriS-l ) 

-TRC  STI!’,r.CW)  '■  r’l!  STARTRS 1.  ,L!A:GTnS-.l  ) 

+ TI(  SURROW)  '-MiR  STARTRS  + l .TjERSTRS-l ) 

STAIJTRS  “ STARTRS  + 1 

STARTS  - STA.RTS  + LERCTIIS  rSTARTS  " STARTS  + 11  + K + 11 
:f.);i;GTiis  - lergtiis  -i 

EIIS  FOR 


C * MOVE  rORMALIRE!!  ROW  FROM  TR  AllS  TT  TO  MR  A!1D  MI 

MIK  STAJ’TR.Lni.'GTR)  ” TF.(  RCWs-l  .T.nilOTH) 

MI(  STARTR,  J.EnGTII)  " TK  ROW M , J.rMGT)!) 

STilRTl!  “ STARTR  + LEROTU  :•  1 C STARTR  " STARTR  + 17  + K + 11 
Elis  I'OR 


C CAI-OULATE  llTIl  OIAGOMAL  RECIPROCAL 

nnCIP(  71)  " 1 . /MR(  EAST) 

MR!  J AST  •>  7..7C)  ■■  RIT  IIP!  ID 'SliV  LAST  + 1,!!) 
IWdAST  + l,iC)  “ R;:CIP(  IM’-'II!  LAST  1.,!C) 
EIIA 
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1 


Cholesky  with  Square  Roots  with  Vectorwise  Augmentation  - LL*"  d 


(see  comments 

under 

★ 

LDL  Decompos 

ition)  

Variable 

Type 

Length 

Contents 

MR 

FP 

N{N+l)/2  [N^l 

rowwise 

HI 

FP 

N(N+l)/2  [N^] 

rowwise 

SR 

FP 

K-N 

real  parts  of  K steerina  vectors 

vectorwise 

SI 

FP 

K-n 

imaginary  parts  of  K steering 

vectors  vectcrvn'se 

RECIP 

FP 

H 

reciprocals  of  diagonal  elements  of 

★ 

L in  decomposition  M = LL 

ROW 

I 

1 

current  row  of  SCM 

SUBROM 

I 

I 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

STARTR 

I 

1 

) 

location  of  ROW+lst  element  of 

ROWth  row 

STARTS 

I 

1 

location  of  SUBROWth  element  of 

SUBROWth  row 

STARTRS 

I 

1 

location  of  SUBROWth  element  of 

ROWth  row 

LAST 

I 

1 

length  of  ROWth  row  starting  at 

column  ROW+1 

LENGTHS 

I 

1 

length  of  SUBROl.ih  row  starting 

in  column  SUBROW 

AUG 

I 

1 

current  steering  vector 

STARTSV 

I 

1 

location  of  ROW+lst  element  of 

current  steering  vector 


Bccin 

LAGT  - N*(n+l)/2  [LAST  - N»N3 
STAHTR  - 2 
LBHGTH  - H 

C CALCULATE  ROm’H  ROV;  OF  L*  FACTOR  IH  M - LL* 

FOR  ROW  - 1 TO  H- 2 


C • CALCULATE  RECIPROCAL  DIAGOHAL  AIID  ROW 

. RnCIP(  ROW)  - X./SQRT(MR(BTARTR-1)) 

LEEGTII  “ LENGTH- 1 

Hn(  CTARTR, LENGTH)  • REC I P(  ROW) *Mn(  OTARTH, LENGTH) 
UK  STARTR, LENGTH)  “ RECIP(  ROW)  *I!I(  STARTR, LENGTH) 

C • CUDTRACT  MULTIPLES  OP  ROVTTH  ROV)  PROM  OTHER  ROWS 


LENGTHS  • LENGTH 
. STARTRG  ■ STARTR 

STARTS  " STARTR  + LENGTH  [STARTS  « STARTR  + HI 
FOR  SUBROV;  - ROV;  + I TO  N-2 

HR( STARTS , LENGTHS ) - Mn(  STARTS , LENGTHS ) 

♦ . . -MR(  STARTRS ) >"[•«’.(  STARTRS , LENGTHS ) 

♦ . . -MI(  STARTRS ) "MK  STARTRS , LENGTHS ) 

. . MK BTARTG+I.LfiNGTHS-l)  - MK STARTS+l .LENGTHS-l) 

+ . . -Mr.(  STARTRS ) <^MK  STARTRS  + .l  .LENGTHS-l ) 

+ . . +MK  STARTRS ) ’■•URt  GTARTRO  + l .LENGTHS-l) 

STARTRS  - STARTRS +1 

STARTS  - r.TARTS+LEHGTUS  [STARTS  “ STARTS  +11  + 13 
LEI.’GTHS  - LENGTH-! 

. END  FOR 


C • SUBTRACT  MULTIPLE  OF  ROOTII  ROV?  PROM  H-13T  ROW 

. Mn(  STARTS . 2 ) - Mn(  STARTS . 2 ) - MR(  STARTRS ) " 

+ . MR(  STARTRS  , 2 ) - MK  STARTRS  ) "M  U STAP.TR3 . 2 > 

MK  STARTS  + l)  “ I’K  STARTS  + l)  - (’R(  STARTRS  ) * 

+ . MK STARTRS  + 1)  + MK STARTRS ) «MR(  GTARTR3  + 1 ) 

. STARTRS  " STARTRS +1 


SUBTRACT  MULTIPLE  OP  ROlv’TH  ROW  PROM  NTH  ROV? 

I!R(LAGT)  " MR'.LAOT)  - MR' STARTRS  ) "MR(  STARTRS ) 

-MK  STARTRS  ) '■MK  STA'iTRS  ) 

START?.  STARTR  + LENGTH  + J.  [STARTR  " ‘ST-AATR  + N + 13 

SUDTR.ACT  MULTIPLES  OP  CURRENT  ROV?  FRONi  GT.EEATNG  VECTORS 

STARTS';  " ROV?  + 1 
OR  AUG  “ 1 TO  K 

SR!  CTARTSV-1 ) " REOIP!  ROV?)  '••SR!  STARTS'/-  I ) 
s::(  STARTS V-1)  “ ROW)  STARTSV-1) 

SR!  STARTSV, LENGTH)  >=  SR!  STARTS  ; . LETVST'l ) 

-SR!  STL.;'.Sv-l)''i;;;i  S"AnT:!, LENGTH) 

-SK  r,TARTSV-l)' }'  ■'  SVA’VR.LE'.'GT;!) 

SI!  STARTSV .TjE'VGTjT)  ■■  SK  f.’.T;'.iV'’f!'W LENT'..’'!) 

+ SH!  S'i'ARTSV- 1 ) ''.'I  U S'i’ARTP. . LENGTH ) 

-ST!  fJTARTSV-). ) ' IIRI  Si'AnTR.. LENGTH) 

SV.ARTSV  ■"  STAilVSV  + II 
ENR  I’Ui’ 

END  FOR 
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CATiCULnTr:  n-ir; 


ncv7 


nij';.'::’:  ii-.i. ) t . /!;r'n7(  r'n(  jJTAr.i'P.-.i > ) 
r;v.\r.'.-.R)  " rii  c.cp!  11-7. ) r:'?;\!!Tn) 

hi;  » Hj-jt::';  n-’  > -iw.  -jvAr.vn) 

c:\vji)'LAT3  iVi'ii  piago!;a3  n:;Gii>R0CA7j 


nnc^inn)  - i./rGnT(i'p.(“,A!;v'  htargr) 'Tint  r^TARTR) 

GTAH”;’.)  ‘■■'ilTC  ''.GA.n’.’I’. ) ) 

SVAIITGV  “ U 

ro.i  r.uo  ■>  .1  TO  ic 


r’lV- '' ) 

o:-  ' r,’ 


:;iM  fiTAr.TOV-  i ) 

!jTA!'.V;3V-!l ) 

/)  ■’  ( OP'  "j't 

- r.7.(  i/.TU;T;;V-l.  )’•?!-( 
GKCTAHTOV)  “ C 0 7'  OTAnTOV)  •.•■ 
I> i'Aj'.VOV'~.L ) ’-IIIK 
GTAnTOV-r/TAnTEiV  + II 

;ri:o  roii 

El  1.0 


:•;( : 


vp;  n-i.) 

...'(  0-1.) 

'■V'..':to7-i  ! -::p(  3 

:■!)  )-;;EGi;p(  ni 


OG.V;;)  )-ilEGTn(  11) 


T.ARTR ) 
T-ARTR) 
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Cholesky  with  Square  Roots  with  Componentwise  Augmentation  - LL 


* 


(see  comments  under  LL 


* 


Decomposition ) 


Variable  Type  Length 

MR  FP  + KN  [N^+KN] 


MI  FP  MtU  +kn  [N^+KN] 


SR 

SI 

RECIP  FP  N 

ROW  I 1 

SUBROW  I 1 

STARTR  I 1 

STARTS  I 1 

STARTRS  I 1 


Contents 

dividing  the  array  into  consecutive 
groups  of  length  N+K,  N+K-1,  N+K-2, 
1+K  [N  the  i^^  group 
contains  the  N+l-i  [N]  real  parts 
of  the  i^^  row  of  the  SCM,  followed 
by  the  real  parts  of  the  i^^  com- 
ponents of  the  K steering  vectors 
analogous  to  MR,  except  it  contains 
the  imaginary  parts 
I contained  in  MR  and  MI 

reciprocals  of  diagonal  elements 

★ 

of  L in  decomposition  M = LL 

current  row  of  SCM 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

location  of  ROW+lst  element  of 

ROWth  row 

location  SUBROWth  element  of 
SUBROWth  row 

location  of  SUBROWth  element 
of  ROWth  row 
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ir  C 

Cholesky  with  Square  Roots  with  Componentwise  Augmentation  - LL  - 
(see  comments  under  LL  Decomposition)  (Cont'd) 


Variable 

Type 

Length 

Contents 

LAST 

I 

1 

location  of  last  element  of  SCM 

LENGTH 

I 

1 

length  of  ROWth  row  starting  at 

column  ROW+1 

LENGTHS 

I 

1 

length  of  SUBROWth  row  starting 

in  column  SUBROW 

B-28 


EEGIIl 

LAGT  « + H*(l]+l)/2  CLAST  " + iinji 

STAETn  - 2 
LEtJGTIl  “ II  ♦ K 

C CALCULATE  I>0V7-.Tn  nOVI  OP  L*  FACTOR  III  H - LL" 

FOR  nOVI  - 1 TO  N - 1 


C * CALCULATE  RECIPROCAL  DIAGOUAL  AtIO  ROW 


RECIPl  nOV7)  - .1 . /SORT!  IIRI  GTARTR-l ) ) 

I.EECTII  - LErCTH  - 1 

nm  STAP.tr, LEECTn)  “ RECIP?  ROVD'^IUK  STARTR.LErGTII) 
I!I(  STARTR.LEIIGTII)  “ RL'CIPC  ROW) ‘'nK  START;!, LEICGTH) 


C « SUBTRACT  MULTIPLES  OF  ROU-TH  ROM  FROM  OTHER  ROVIS 


LEITGTliS  " LEIIGTH 
GTARTRS  " STARTR 

STARTS  " STARTR  + LEITGTH  [STARTS  " STARTR  + 11  + K] 

FOR  SUBROV;  - ROM  + 1 TO  11 

MR(  STARTS,  LEMGTnS)  " MU(  STARTS  , LEIIGTIIS  ) 

♦ . . -MR(  GTARTRS  ) '■MP.l  GTARTRS  , Lir.lGTHS ) 

+ . . -!!!<  STARTRS ) '•MK  STARTPG  , LEITCTRS ) 

. . Mil  GTARTG  + l,j,LTlGT!IS-.l  ) MK  GTA)'.TS+ 1 .LErCTIIS-l ) 

+ . . -1:R(  GTARTRS  ) ’•'!;■:(  GTARTRS  + .l.  ,LE'.rjT:iS-.l ) 

+ . . +;!!(  GTARTRS ) ’•'MRl  STARTP.S  + .l  ,LBI.'GTH3-1 ) 

STARTRS  - GTARTRS  + 1 

GTiVRTS  “ STARTS  + LEIIGTIIS  [STARTS  " STARTS  + 11  + 1 
LL'IIGTIiS  " LEIIGTIIS  - 1 
nilD  FOR 

STARTR  “ STARTR  + LEKGTII  + 1 CGTARTR  ■■  STARTR  + 11  + K 
EllD  FOR 


C CALCULATE  Il-Til  DIAGOUAL  EI.nMEIlT 


nnCXPdl)  1 . /GOP.T(  .f’R(  LAST)  ) 

MR'  I.ACT  >.l.  ,K)  RECIIM  11)  '-MR;  LAGT  + .l  ,K) 
in':  LAGT+.l.  ,11)  “ RECIP(  niniK  LAST+,l,K) 
LTID 


t + 1] 

+ .[] 
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Gaussian  Elimination  with  Columnwise  Identity  Matrix  Augmentation-GE- 


Variable 

Type 

Length 

Contents 

MR 

FP 

n2 

rowwise 

MI 

FP 

N^ 

rowwise 

SR 

FP 

real  part  of  identity  matrix 

columnwise  (i.e.,  not  interleaved 

with  rows  of  SCM) 

SI 

FP 

n2 

imaginary  part  of  identity  matrix 

columnwise  (i.e.,  not  interleaved 

with  rows  of  SCM) 

RECIP 

FP 

N 

reciprocals  of  components  of  D 

* 

in  decomposition  M = LDL 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth  row 

is  subtracted 

STARTR 

I 

1 

location  of  ROW+lst  word  of  ROWth  row 

STARTS 

I 

1 

location  of  ROW+lst  word  of  SUBROWth  row 

LENGTH 

I 

1 

length  of  ROWth  row  starting  at  column 

ROW+1 

AUG 

I 

1 

current  column  of  identity  matrix 

STARTSV 

I 

1 

location  of  ROW+lst  element  of  current 

column  of  identity  matrix 

STARTSVM 

I 

1 

location  of  ROWth  element  of  current 

column  of  identity  matrix 

BEGIN 
LENGTH  “ n 
STARTR  - 2 


G ELIMINATE  P.OV/TK  COLUMN  SO  IT  HAG  EEnOES  EELON  A UNIT  DIAGONAL 

Fon  p.ov;  " 1 TO  n-2 

nEciP( novn  - i./mp! sTAPTn-i) 

LENGTH  ” LENGTH- I 

C « MAKE  UNIT  DIAGONAL 

Mn(  GTAPTR, LENGTH)  - PECIPt  POV?) ''MP?  STAnTP, LENGTH) 

MI ( GTARTP, LENGTH)  “ PECIP(  POV?) 'MK  STAP-TU, LENGTH) 

C • ELIMINATE  DELOV/  THE  DIAGONAL 

STAPTn-STARTP 
FOP  suDPov;  - pov;+.i  to  n 
STAPTS  “ STAP.TS+II 

Mn(  STARTS  .LENGTH)  ■"  Mn(  STARTS  .LENGTH) 

-M.R(  GTAr.TG-.l)*-!'R(  STARTR, LENGTH) 

-il’K  GTAPTS-D'-nK  STARTR. LENGTH) 

MI( STARTS. LENGTH)  " MI(  STARTS .LENGTH) 

-MRC STARTS-1)"MT(  STARTR. LENGTH) 

-MI  ( STARTS  - 1 ) >'MR(  STARTR . LENGTH ) 

END  FOR 

C " ELIMINATE  PROM  IDENTITY  MATRIX 


STARTSV  “ ROVJ+1 
FOR  ATJG  = 1 TO  ROV/-.). 

SiH  STARTSV.LEI.'GTH)  " SRC  STARTSV. LENGTH) 

-;;i:(  startsv-.1)  ■)!?.(  start?.. length) 

. “SK  ST-ARTSV-.l ) ■'MK  STAnTR. LENGTH) 

SI(  STARTSV. LENGTH)  " SI(  STAR.TSV. LENGTH) 

. +SR(  STARTSV-’ )-i’.E(  STARTR. LE\;GrH) 

-SI(  STARTSV-.’. ) '■■r  ;P.<  STARTR  .LENGTH) 


STARTS'; 
END  FOR 


TARTSV+N 


f:R(  STARTSV. LENGTTi)  ■ 
SI(  STARTSV. LENGTil)  • 
STAP.TR  “ STARTR.MI'!-.’. 
END  FOR 


- MR' ST.Ar;TR , LENGTH ) 
UK  S'A'.F.TR . t.IHTGTH ) 


Ei<iMimvr:  ii-ict  coi.umu 

HECii’c  ii-.i ) " 1 . /nn(  :j'7Ar.i’n-i ) 
line  GVAnTn ) - nircii'c  i.i -i) 'r’.n(  sTAnm) 
Mi(  svAn'iij)  » nee li';  n-.\ ) *i!X(  ci^iiTn) 


c?.r.c’JLATi]  nEcirrioc.i.',  o?  laet  oiAcomi,  eee!1ei7T 
GTAn?D  " DTAHTn  + !1 

ncciPdi)  “ 1 ./( in?;  G p-VG)  - nrn  sTAnTn)--nn(  g^arts-i) 

+ +ni(  GI’Ai'.'^i?  ) 'GiK  CTiinVG-l ) ) 


LLinillATE  I'nOM  innUTIEY  tlATHIX  AIID  DIVIDE  DY  DIAGOIIAT.  EEEMEnTS 


LErOYII  " II 
CYARYGV  " II 
CTARYGVI!  = I 
FOR  AUG  " .1  TO  II- 2 

Gn(  ETARTD7)  " fini  GTAnTGV)  - GDI  GTARTOV- 1 ) 

+ . ••rUYGTART!??  - GK  CYAHraV-I ) "'IK  GTARTR) 

GKCTARTCV)  = UK  nVARTUV)  + GW  GTARTriV- 1. ) 

+ . ’•M  i'.'  G.V'AR'i'l?)  - GI(  GTAI?'..:.'i7-.1.  ) GTARTR) 

Gi?(  sTAr.TGvrM.Ei-iGYn)  " nn':i)>(  Aui-i,j-.Ei:GT’.n ’UjRi  a'i’;'.RYGv;T,7.iiiiGTH) 

GK  GTA!iTGV!i!-;i. '•  n'rjy.Pi  AUG-M.EniiGTR-i)-' 

♦ . GK  G'.'AiriGVm-l  .LEIIGTII-I  ) 

G'i’Ar.TGVn  “ GVARTGVM  H -i-  .1 
GVAR'i’GV  » GVARTGV  * II 
;GV’I  '•*  IiDMG’i’Il  - 1 

niiD  !.’on 

GDI  G'iV.nTGV)  = G!?(  GTARYGV)  - CP.I  GTAr.TGV-l  > ’•MRI  GTARTR) 

■i-  ••  nT(G'.;;u:'./Gv-i  I’-nKUTAETi?) 

GK  GVIMtTGV)  =■  GK  GViP’TGV)  GlM  G TAidGV-I ) -MK  G;i;ART:?) 

+ - GK  GTAr.'.'GV-l ) ’-11’??  G'’A!;T;’.) 

GR(  GTAnTr;7ri,2 > ”■  rE':v3i!  ii-i.  .?.)■  ;-i;f  gta:?7G7M,2) 

G"<  G.'JARv.'GV; i-i-i. ) •’  I’:';-';:.".”:  id’^gk  g'oartg'':'!-!-!) 

■■  g'...'.''.;;'-.’gv:!  : ij-  i-  i 
Gi:;  G'i'ARVGVii)  >■  r.!-;r.;an  II) 

Eli.O 
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Gaussian  Elimination  with  Rowwise  Identity  Matrix  Augmentation  - GE  - 


Variable 

Type 

Length 

Contents 

MR 

FP 

2N^ 

i^*^  group  of  2N  words  contain  real 
part  of  i^  row  of  SCM  followed  by 
real  part  of  i^^  row  of  identity  matrix 

MI 

FP 

2N^ 

analogous  to  MR,  but  contains  imaginary 

points 

SR 

i 

SI 

1 contained  in  MR  and  MI 

RECIP 

FP 

N 

reciprocals  of  elements  of  D in 

★ 

decomposition  M = LOL 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth  row 

is  subtracted 

STARTR 

I 

1 

location  of  ROW+lst  word  of  ROWth  row 

STARTS 

I 

1 

location  of  ROW+lst  word  of  SUBROWth  row 
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Dnoiii 

lliWri'n  “ 2 


c KT.imi’ATn  novjTii  coLuini  do  it  hag  nnpoi.'D  nELoy  nx 


pon  no\7  " i TO  iT-i 

nEciP(  now)  “ i./r;n(  DTiinTn-i) 

C * I’J\rCE  UniT  DIAGOimi. 

p.r.(  nTPir.TP.,!!)  = nTciPt  rrivn’'i’.'!(  dt.tp.tp.,?]) 

IPiK  SVAnV!Ml-l)  “ ni;CXP(  I’OV,’) '-MK  DTi\nTIJ,N-l) 
C ■'  ELinin.-XTE  BELOW  THE  mp.GOnAL 


SVAHTr.  " DTAl’.TH 

ron  p.Tjnpov;  - now+.i  to  ii 

GTAnTr;  >'  cpap-td  + ii  + n 
. . lint  STAHTG,!!)  ■’  f.TAi’.TD  ,11) 

+ . . -nnt  GT.AnVH-l)v.iw  DTAnTjl.ll) 

+ . . •WIKHTAnTD-lW^ilKETAnTn,!!) 

1!T(  STAP.TfJ,!!)  " IPCt  HTAnTD  ,11) 

+ . . -nnt  DTAnTD-.i)  ■ MT(  ;iTAnTn,u) 

+ . . -I  n ( D TAHTD  - 1 ) ■■nnt  ! j TAlITn , 11 ) 

E'lO  FOR 

ETAnTH  " ETAHTR  + 11  f 11  + 1 
Bin  I'on 

HEJXPtil)  ■■  .1. . /IlP.t  ETAnTR-l) 

iin;  DTPRTi:,!!)  " p.ec.vim  rn'-nnt  nTAF.TR.n) 

!1J.(  ETAR'iTl,lI-.l. ) = HECaIH  I1)-M:C<  IlVAl’.Ti!,!:!-). ) 

Lll.O 


A'l'OrAL 
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Cholesky  without  Square  Roots  with  Columnwise  Identity  Matrix 
Augmentation  - LDL*''[ 

(see  comments  under  LDL*  Decomposition) 


Variable 

Type 

Length 

MR 

FP 

N(N+1)/2  [N^] 

MI 

FP 

N(N+l)/2  [N^] 

SR 

FP 

n2 

SI 

FP 

n2 

TR 

FP 

N 

TI 

FP 

N 

RECIP 

FP 

N 

TRl 

FP 

1 

Til 

FP 

1 

ROW 

I 

1 

SUB ROW 

I 

1 

START  R 

I 

1 

STARTS 

I 

1 

Contents 
rowwi se 
rowwise 

real  part  of  identity  matrix 
columnwise  (i.e.,  not  interleaved 
with  rows  of  SCM) 
imaginary  part  of  identity  matrix 
columnwise  (i.e.,  not  interleaved 
with  rows  of  SCM) 
temporary  array 
temporary  array 

reciprocals  of  elements  of  Din 

decomposition  M = LDL* 

temporary  location 

temporary  location 

current  row  of  SCM 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

location  of  ROW+1 st  element  of 

ROWth  row 

location  of  SUBROWth  element 
of  SUBROWth  row 
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Cholesky  without  Square  Roots  with  Columnwise  Identity  Matrix  , 

Augmentation  (Cont'd)  -LDL*- 


Variable 

STARTRS 

LAST 

LENGTH 

LENGTHS 

AUG 

STARTSV 


STARTS VM 


Type 

I 

I 

I 


Length 

1 

1 

1 


Contents 

location  of  SUBROWth  elements 
of  ROWth  row 

location  of  last  element  of  SCM 
length  of  ROWth  row  starting  at 
element  ROW+1 

length  of  SUBROWth  row  starting 

at  element  SUBROW 

current  column  of  identity  matrix- 

location  of  ROW+lst  element 

of  current  column  of  identity 

matrix 

location  of  ROWth  element  of 
current  column  of  identity  matrix 


DEGIN 

LAST  - N"(H*l)/2  CLAST  “ H*N} 
STAHTR  « 2 
LCnOTIl  - N 


CALCULATE  nOWTII  HOW  OF  L*  FACTOH  IH  M - LOL* 

FOR  ROV»  - 1 TO  N-2 

• MARE  UniT  DIAGORAL,  SAVE  RECIPROCAL 

. RECIP(  Rov;)  - I./nm  BTARTR-l) 

. LEIJGTn  " T.nnGTII-1 

Tn(  ROW+l.LEKGTII)  ••  RECIPC  ROV?) niRt  STARTR.LEIJGTn) 
. TI(  ROV;+ 1, LENGTH)  “ RECIPC  ROV?)  *MI(  8TARTR, LENGTH) 

* SUBTRACT  MULTIPLES  OP  ROVmi  ROV;  PROM  OTHER  ROVra 


LENGTHS  - LENGTH 
G7ARTRS  - STARTR 

STARTS  - STARTR  ♦ LENGTH  CBTARTS  - BTAHTU+N] 

FOR  BUBROX/  - ROH>.l  TO  H-2 

MR(  STARTS  , LENGTHS ) ••  MRt  STARTS , LENGTHS  ) 

-TR(  BUBROU) "MRC  STARTRS .LENGTHS ) 

-TT(  BUBROX))  "MK  BTARTRS  .LENGTHS) 

Ml(  STARTS  + l.E.-NGTHB-l)  - MK  BTARTS+l  .LEKGTHS-l ) 
-Tin  SUBROXJ) ' nT.X  BTART3  + 1.LENGT:I3-1) 
+T;C(  BUBROW)mR(  STARTS  + I. LENGTHS-!) 
STARTRB  » BTARTRO+l 

GTARTS  » STARTS  + LENGTHS  CDTARTB  - BTARTS+N+ll 
I.ENGTHS  - LENGTHS  -1 
END  FOR 


+ . 


SUBTRACT  MULTIPLE  OP  ROV.'TH  ROV)  PROM  N-IST  ROXf 

MR(  STARTS . 2 ) " f!R(  STARTS  . 2 ) -TR(  N-l ) ''IIRI  BTARTRS . 2 ) 
-TI<  H-,1. ) niK  BTARTRS  . 2 ) 

MIC  STARTS+l ) - ni{  STARTS  + l ) -TR(  N-l ) ’•’MK  STARTRS  + l) 
+T.T(  H-!  ) "IIRI  STARTRS  + l ) 

BTARTRS  - STARTRS+l 

SUBTRACT  NULTIPI.E  OP  RONTH  ROV)  PROM  NTH  ROV) 


I!R(  LAST ) 


MR(  LAST ) -TRC  N)  ’'MIU  STARTRS ) 
-TI<  II)  ’■■UK  STARTRS ) 


MOVE  NOrmLIEED  RON  FROM  TR  AllB  TI  TO  MR  ANU  MI 

I:R( STARTR , LENOTH ) “ TR(  RCN+ 1 , LENGTH) 

I IK  STARTR . T ENGTIi ) '■  TK  r.C'  i- 1 . J.TRTGTH ) 

STARTR  ” GTAIiTR-M.rNGTII+l  rSTARTR  " STARTR  + N + 13 

ELIMINATE  PROM  IDENTITY  MATRIX 

STARTS'/  " nOWH 
FOR  AUG  " 1 TO  ROVI-l 

SR(  GTARTDV,LI’i:-GT;l)  ■■  SR' STARTSV, LENGTH) 

+ . . - Si:(  BTARTS’;--!  TR'  ROU+l  .THR-TTH) 

+ . . -SK  GTAnTrN/-.l. ' "TK  ROVI+l  .LENGTH) 

• I' *'(  STr R'^'EV . ^1*^' ' '*  ^ T ’’T) 

. +S!’.(  S'^ARTSV-l ) ’H.’K  ROU+1  .LENGTH) 

. -SK  GVARTSV-.l ) "TiU  ROV/!-!  .LENGTH) 

SYARTfiV  ~ STAKTSV  + 11 
END  l Oil 

SI!(  CTAl’TSV. LENGTH)  » - I!R(  STARTR , IHINGTII) 

SK  S'.V/'rTCV.LElXiTu)  " I’K  STARTR .Ta'.N'ITin 
END  lOR 
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CALCULATE  U-IGT  nOV7 


nncip(  «-i ) “ 1 . /i!n(  rtaetr-i ) 

TRl  - RECVp;  n-.1.)>  nn(  ETAUTR) 

Tn2  - RECIP(  ETARTR) 

CALCULATE  IITII  DIAGOUAL  RECIPnOCAL 

PECIPdT)  " l./{I’n(IAC?)  -Tni^nnc  ETARTH) 

+ -TIl’^IlK  ETARTR)  ) 

HR'.  GTARTni  “ TT.l 
UK  2TARTR)  “ Til 

ELiniEATE  FROIl  IDEHT-ITY  inTHi::  AIID  DIVIDE  RY  DIAGOl-'AL  ELEMEtlTS 

LEEGTH  - N 
STARTUV  - II 
STARTSVIl  1 
FOR  AUG  “ \ TO  ri-2 

. SRISTARTEV)  “ SR' ETARTEV)  -ERI  STARTSV-l ) '■TRl 
+ . -si(  dtar':ev-1)''tti 

EKSTARTSV)  " EK  EVARTEV)  + ER(  GTARTEV-l ) 'Til 
+ . -EK  ETARTEV-.T  > ‘ Tn’. 

SRI  ETARTEVI1,LCI:GVI;>  » RECIR' AUG.Lni.-^TrD'-E-.'.I  ETARTEVIl.LEiTGTII) 
EK  ETARTEVI!-H,LL’liGTn-.l)  “ RECTPI  AUG  v 1. , LE:;GT:I-1  ) * 

♦ . S K ETARVEVn >■  1 , LEfOTU- 1 ) 

ETARTEVri  “ ETARTEVl!  ♦ II  + 1 
ETARTiiV  = ETARTE7  ► 1 
LEROTn  ” LEIIGTri  -1 
Uin  FOR 

SRI  ETAREV)  = ERI  ETARTEV)  -ER(  ETARTEV- 1. ) ' FRl-EK  ETARFEV-l ) •’'Til 
EKETARTEV)  " EKETARVEV)  + El.’f  ETARTEV-1  > "Tl  1 
+ -EK  ETAF.TEV-l ) ■'TRl 

ERI  EVARTEV?’..  2 ) = RTdIRI  ■!  -1 , 2 ) ’■•'ERI  ETARTEV': , ) 

EK  ETARTEv I ) " R'J'Kl'I  ID  '-GK  ETARTEVIDl ) 

GTARTEVIl  " E'iARTGVn  ■>  U + 1 
ERIETARTEV'D  ■■  RRCIPIID 
L’lID 
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Cholesky  without  Square  Roots  with  Rowwise  Identity  Matrix  Augnertation 
- LDL*-  1=1 

(see  comments  under  LDL*  Decomposition) 

Variable  Type  Length  Contents 

MR  FP  ^ [2N^]  dividing  the  array  into  con- 

secutive groups  of  length  N+K, 
N+K-1,  N+K-2,...,1+K[N+K],  the 
i^*^  of  these  groups  contains 
the  N+l-i[N]  real  parts  of 
the  i^*^  row  of  the  SCM,  fol- 
lowed by  the  real  part  of  the 


i^^  row  of  the  identity  matrix 


MI 

FP 

+^2  [2n2] 

analogous  to  MR,  except  it 

contains  the  imaginary  parts 

SR 

SI 

1 contained  in  MR  and  MI 

TR 

FP 

N 

temporary  array 

TI 

FP 

N 

temperary  array 

RECIP 

FP 

N 

reciprocals  of  elements  of  D in 

decomposition  M = LDL* 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

START R 

I 

1 

location  of  ROW+lst  element  of 

ROWth  row 

STARTS 

I 

1 

location  of  SUBROWth  element 

of  SUBROWth  row 


Cholesky  without  Square  Roots  with  Rowwise  Identity  Matrix  Augmentation  (Cont'd) 


★ 

- LDL  - 

— 

Variable 

Type 

Length 

Contents 

LAST 

I 

1 

location  of  last  element  of  SCM 

LENGTH 

I 

1 

length  of  SUBROUth  row  starting 

at  element  SUBROW 
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-K 


BEG  I II 

l,i\GT  - H*(I1-1>  + N + (II+l)/2  CLAST  • IJ^C  2*11-1 ) ] 

STARTR  - 2 

C CALCULATE  nOHTII  ROW  OF  L*  FACTOR  IN  H - LDL* 

FOR  ROV7  - 1 TO  H-1  , 

HAKE  UIIIT  DIAGONAL,  GAVE  RECIPROCAL 

RECIPC  r.OVl)  - 1 . /HR!  STARTR-l ) 

TR(nOV;>l,H)  - RECIPC  r.OVl)  niRCSTARTR, ID 
TI(  r.OW+ 1,11-1)  - RECIPC  ROVD'UIK  BTARTR.H-l) 

SUBTRACT  MULTIPLES  OF  ROWTII  ROW  FROM  OTHER  ROWS 

LENGTH  - N 
OTARTRS  “ START 

STARTS  " STARTR  + 11  + N-EOVl  [STARTS  ^ STARTR  + N + ID 
FOR  GUBROW  " ROW  + 1 TO  N 

Mn(  STARTS, LENGTH)  - MR(  STARTS .LENGTH) 

-TRl  SUBROW) "MR(  STARTRS .LENGTH) 

-TT(  GUBROW) «rn(  STARTRS .LENGTH) 

MI(  STARTS  + 1, LENGTH-1)  » MI( STARTS  + l .LENOTH-l ) 

. -TIH  SUBROW)  'MT(  CTARTRS  + .l , LENGTH-1 ) 

+TT(  GUBROW)  '■•I1R(  GTARTR3  + 1 , LENGTH-1 ) 

CTAHTRS  “ fJ^AHTnC  + X 

STARTS  “ STARTS  + tl  + 17+1  - GUBROW  CGTART3  “ STARTS  + N + N 
LENGTH  “ LENGTn-1 
EtlD  FOR 

MOVE  NORMALISED  ROW  FROM  TR  AND  TI  TO  MR  AND  HI 

nn(  STARTR, N)  ■>  TR(ROW+.T,n) 

MI(  STARTR, N-1)  " TRC  ROW  + .l.,N-l) 

STARTR  " STARTR  + 11  + N+l-ROW  CSTARTU  " STARTR  + 11  + T:1  + 1] 

END  FOR 

C CALCULATE  NTH  DIAGONAL  RECIPROCAL 

RECIPll)  “ 1 . /MRl  L.AST) 

nm  LA'j'i’  + l.M)  ■■  RECTin  N)"MR(  x..AGT+.l,N) 

I !i ( LAST  H , N - i ) " RE'.:  :CP(  11 ) '-MI ( LAST  + 1 , N- 1 ) 

LND 
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Cholesky  with  Square  Roots  with  Columnwise  Identity  Matrix  Augmentation 
- LL*  - F=^ 


(see  comments  under  LL*  Decomposition) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

rowwi se 

HI 

FP 

N(N+l)/2  [N^] 

rowwise 

SR 

FP 

real  part  of  identity  matrix 

columnwise  (i.e.,  not  interleaved 

with  rows  of  SCM) 

SI 

FP 

n2 

imaginary  part  of  identity  matrix 

columnwise  (i.e.,  not  interleaved 

with  rows  of  SCM) 

RECIP 

FP 

N 

reciprocals  of  diagonal  elements 

of  L in  decomposition  M = LL* 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple  of  ROWth 

row  is  subtracted 

STARTR 

I 

1 

location  of  ROW+1 st  element  of 

ROWth  row 

STARTS 

I 

1 

location  of  SUBROWth  element 

of  SUBROWth  row 

STARTRS 

I 

1 

location  of  SUBROWth  elements 

of  ROWth  row 

LAST 

I 

1 

location  of  last  element  of  SCM 

LENGTH 

I 

1 

length  of  ROWth  row  starting  at 

element  ROW+1 
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Cholesky  with  Square  Roots  with  Columnwise  Identity  Matrix  Augmentation  (Cont'd) 


- LL*  - 

Type 

Variable 

Length 

Contents 

LENGTHS 

I 

1 

length  of  SUBROWth  row  starting 

at  element  SUBROW 

AUG 

I 

1 

current  column  of  identity  matrix 

STARTSV 

I 

1 

location  of  ROW+lst  element 

of  current  column  of  identity 

matrix 
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P 


CEGIll 

LAGV  - ir'(n+l)/2  CLAST  » H»t1J 
STAP.Tn  - 2 
LEIJGTH  “ U 


C CALCULATE  EOWTII  HOW  OP  L»  FACTOn  10  II  - LL" 

pon  now  - 1 TO  it- 2 

C " CALCULATE  KECIPKOCAL  DIAG0I7AL  AHD  P.OVT 

EECIPI  now)  ” l./SQRT(  HR(  STARTH-l) ) 

LErrOTIi  . LEITGTII-.). 

Mn(  STAnTlT.I.EOOTn)  " P.ECIPI  POVT)  »I!n(  START?., LEiTGTH) 
UK  START?, LEEGTI!)  " RECIPC  RO',',’) 'WIK  START?, LEKGTH) 

C » SUETRACT  MULTIPI.ES  OF  ROVTTII  PvOVJ  PROII  OTHER  ROV73 


+ 


+ 


LEIIGTIIS  " LEOGTH 
STARTRS  " STARTR 

STARTS  “ STARTR  + T.ERGTIT  CSTARTS  " STARTR  + H) 

FOR  SUBROW  “ R0'.7:-.\  TO  II- 2 

MR(  S TARTS  , L EIIGTHS ) " HR!  S TARTS  , LEIIGTIIS ) 

. -I!R(  STARTRS ) •■i’RC  STARTRS  , LEHGTHS ) 

-I  !'X(  STARTRS ) *MT.(  STARTRS  ,LBI1GT:iS ) 

UK  STARTS-!-l,LEliCTIIS-l)  = nX(  STARTS.- 1 , LEIIGTIIS ) 

-!IR(  STAnTns)*!!t(  STARTRS  i- 1. , LETOTHS-l ) 
STARTRS)  ‘•Iin(  STARTRIM- .1 , LEIIGTIIS-. I ) 
STARTRS  = STA.RTRSi-l 

STARTS  » ST.ARTS  •>  I.EIIGTIIS  CSTARTS  = STARTS  * II  -s- 
v^nriGTHS  “ LiJAGTII  -1 
nriD  FOR 


C * SUBTRACT  IITILT.TPLE  OP  ROlv'TIt  ROW  FROM  D-.IST  R0I7 


line  STARTS  , 2 ) ■■  I STARTS  , 2 ) -lUK  STARTR:; ) 

•>  . n.R(  STARTR.S  , ?. ) -!IX(  START,’.:! ! -I'lC  STARTRS  , 2 ) 

MX(  STARTr.■^I.  ) !'X(  STARTS  I)  -I  UK  STARTRS  ) ••» 

*.  Hit  STARTRS■^.^)  <l\H  STARTTIS  ) ■'’IRK  STARTRSH) 

STARTRS  ”■  CTARTI’.S  * 1 


SV:r-TiI.XCT  IIoLTXP.LM  of  ROWTI!  row  FROII  IITII  nOI7 
UR(  L.AST) 

START?  " 

ELII'.XIATE  XREIITXTir  IIATRI 


" IRII  T..'..;T)  -T’R'  ST.AR'TrS ) '■.'IlK  Si’AJ’.TRS) 

-iixi  r.TARTns ; star.tr:;') 

STARTR  > iT.’CT'.I  > .1  f.ST.IRTR  --  STARTR  + II 


S'.'.A'TT.I''  '■  r.f‘;W:-I. 

L.'"’.  .V  =’  .1  '■-■O  RR',7-.I 

*■”(  S'^.A^’-’r’l’" ^ ^ ” f-r.*;!'!;  I'n’/) --s. i;'X'^'.V‘’S'7-I ) 
f;:a  ■).) 

. f.-ii  /. 

S'Avir-.Ti'^-  •. ) -'i '•'(  ..  -ji'i'  jV’! > 

r t-.  fV.’*  ■>  T] 

rii.o  }i‘P. 

S!;' S'.;i;;T;>7-I  ) •’  i;il.,vp(  l.:.;W) 

s.ic  svART.;v,r,::':Ti  n - rx:  f;TARTR.:','T:':’^  :i-Tr:j:i‘’;  rsw) 
sx(  st.r;t.i  I, i i:  " r;.T  i tarvr, r,:  ;- 

IliO  LOR 
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C CntCUL-VVE  II- 1ST  now 

nnciinn-i)  - i./sonTdintoTAHTii-i)) 
i:n(  OTAiiTU)  ” rcciP!  :i-i  > '^nni  sTAr.Tn) 
ni(  STAKTU)  “ r.CCIl’C  Il  -l ) 'IIK  STAlITr.) 

C CALCULATE  IITII  DIA.COEAL  HECIPnOCAL 

nECIP(Il)  = X./COnT(IUl(LAST)  - nr:(  STARTS)  STAHTR) 

+ -ni(  STARTR)  --nK  STARTP.)  ) 

OTARTSV^n 

rOR  AUG-l  TO  n-l 

sin  STARTGV- 1 ) - RECTIH  II-.I. ) "fjn(  STAP.TSV-1 ) 

SK  STARTSV-l)  •’  RECIPI  11-1  > ’••SK  SVAP.TOV-l ) 

GiK  STARTEV)  “ ( SR(  STARTSV)  -SR' STARTS7-1 ) " 

+ . ninSTARTR)  -SV(  STARTSV-D'HIC  r,T.Y:->T:i)  jrtncCTPin) 

SKGTARTSV)  " ( On  STARTS V)  + SR(  STARTS'/- 1 ))  * 

+ . UK  STARVR)  -SI(  STARTSV-1 ) ’ niK  STAl’TH)  > 

+ . ''RECrCPdD 

CTARTGV'“  STARTSV+tl 
nna  ron 

SR(  STARTGV)  " RECIP(  11) 

EIID 
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Cholesky  with  Square  Roots  with  Rowwise  Identity  Matrix  Augmentation 
- LL*  ^ 

■ (sRe_COni!)entS  under  LL*  r)prf)mpn«;ition) 


Variable 

Type 

Length 

Contents 

MR 

FP 

MNlli  , [2n2] 

dividing  the  array  into  con 

secutive  groups  of  length  N+K, 
N+K-1,  N+K-2,...,1+K[N+K],  the 
of  these  groups  contains 
the  M+l-i[N]  real  parts  of 
the  i^^  row  of  the  SCM,  fol- 
lowed by  the  real  part  of  the 
i^^  row  of  the  identity  matrix 

MI  FP  N(N+1 ) ^ |■2,q2j  analogous  to  MR,  except  it 

contains  the  imaginary  parts 

I contained  in  MR  and  MI 

reciprocals  of  diagonal  elements 
of  L in  decomposition  M = LL* 

current  row  cf  SCM 
row  from  which  multiple  of  ROWth 
row  is  subtracted 
location  of  ROW+1 st  element  of 
ROWth  row 

location  of  SUBROW th  element 
of  SUBROHth  row 


SR 

SI 


RECIP  FP  N 

ROW  I 1 

SUBROW  I 1 

START R I 1 

STARTS  I 1 
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Cholesky  with  Square  Roots  with  Rowwise 


Variable 

( 

- LL*  - 
Type 

[- -§ 

Length 

STARTRS 

I 

1 

LAST 

I 

1 

LENGTH 

I 

1 

LENGTHS 

I 

1 

Identity  Matrix  Augmentation (Cont'd) 

Contents 

location  of  SUBROWth  elements 
of  ROWth  row 

location  of  last  element  of  SCM 
length  of  ROWth  row  starting  at 
element  ROW+1 

length  of  SUBROWth  row  starting 
at  element  suSPnw 
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cnQiii 

LAST  - N*(I1-1)  + N"(H+l)/2  CLAST  - U*(  2*11-1)] 
STARTS  - 2 

CALCULATE  ROWTI!  ROV;  OF  L*  FACTOR  II]  M - LL* 
FOR  ROW  " 1 TO  n-1 

• CALCULATE  RECIPROCAL  DIAGOI7AL  ADD  ROW 

RECIPC  now)  - l./CORT(  nr.(  STARTR-1) ) 

I!i’{  fiVARTR,I7)  » REC.TP!  R0W>>-I3R(  STARTS,  11) 

UK  GTAliTR,!]-!)  “ RECIPC  ROW) '■MI(  STARTR, 17-1 ) 


GUDTRACT  MULTIPLES  OF  ROI/TII  ROW  FROM  OTHER  ROWS 


EIlO 


LREGT!!  - 17 
SVARTRO  " STARTH 

STARTS  “ STARTS  + 17  + ll-ROW  CSTARTS  “ STAP.TH  + 17  + 17] 

FOR  !;v'r,RC\7  " row  + i to  ii 

I’RC  STARTS, LL’ACTII)  " MR(  STAu;TS  , LB11C7TH) 

-IIRC  STAP-TRS ) I!'R'  STARTRS  , LERCTIl ) 

-MIC  STARTRD  ) -■l.'.yC  DTARTRS  . T.EWCTM) 

MIC  STARTS  + 7 ,LT;!7GT:!-.i.  ) ILTC  GTA.RTSU  ,LiH:CT71-l  ) 

-7’R(  STARTRG  ) '■  MK  STARTPS  + .l , L.;’.:CTH-1 ) 

< !’.K  GTARTnD>'  r!R(  GTARTRS-!-!  . LERCTII-l ) 

. STARTRS  “ D'D'.RTRD  +1 

STARTS  " STARTS  17  + 17  + .1-STJ3ROV7  CSTARTS  ">  STARTS  + N + 17  + 1) 
TJACGTil  " l.RRGTlI  -1 
i::v)  I'o.o 

STARVES  " STARTR  + 17  + 11  + l-ROW  CSTARTP.  - STAP.TR  + 11  + 17  + 1] 

FOR 


CALCUI.ATE  17T!I  DIACORAL  ELEMENT 


R7:CTP(  1 ) n I . /JinC  I AST  ) 

IIRCLAST  + .1,77)  - RRC.TPC  17)  "?!R(  lAST  + l ,!7) 
7nc  LAST  + 1 ,17-1 ) = RECIPC  1 ) ■•  rilC  LAST  + 1 ,77-1 ) 
E17D 


First  Back 

or  GE  with  L* 

Stored  Rowwise  and  S Vectorwise 

(see  comments 

under  LDL*  Decomposition) 

Variable 

Type 

Length 

Contents 

MR 

FP 

N{N+l)/2  [N^] 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2  [N^] 

imaginary  part  of  L*  factor 

rowwise 

SR 

FP 

KN 

real  part  of  steering  vectors 

vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering 

vectors  vectorwise 

RECIP 

FP 

N 

reciprocals  of  elements  of  D 

factors 

AUG 

I 

1 

current  steering  vector 

ROW 

I 

I 

current  row 

STARTSV 

I 

1 

location  of  first  element  of 

current  steering  vector 

STARTSVA 

I 

1 

location  of  ROW+lst  element 

of  current  steering  vector 

LENGTH 

I 

1 

length  of  ROWth  row  of  SCM 

starting  at  element  ROW+1 

START  R 

I 

1 

location  of  ROW+lst  element 

of  ROWth  row  of  SCM 
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DEGIIl 

STAn'i’SV-1 


C DO  EACH  CTEERIKG  VECTOR 

FOR  AUG  “ 1 TO  K 

C • nUDTRACT  MULTIPLES  OP  EACH  ROW  OP  M PROM  THE  STSERIEG  VECTORS 

STARTSVA  - GTARTGV  ♦ 1 
STARTR  “ 2 
LEtlGTII  - N-1 
. FOR  ROV;  “ .1  TO  N-2 

SRC  STARTSVA.LEKGTII)  ” SRC  STARTSVA, LEEGTM) 

♦ . . -SRC  STARTSVA-l ) »tIRC  GTARTR , LEEGTH) 

+ . . -SIC  STARTGVA-1 ) ''•'MIC  STARTR , IjEHGTH) 

SIC  STARTSVA, LKEGTin  - SIC  STARTSVA, LEEGTII) 

+ , . -.'■SRC  STARTSVA-.l ) ‘■MIC  STARTR, LEEGTH) 

+ . . -SIC  STAR'i’GVA-l ) 'MRC  STARTR, LEHGTH) 

. . STARTSVA  “ STARTSVA  + 1 

STARTR  “ STARTR  + LEEGTII  + 1 CSTARTR  " STARTR  + M ♦ 1] 

LEKGTII  » LEEGTH-I 
. EtlD  FOR 

. SRC  STARTSVA)  “ SRC  STARTSVA)  -SRC  STARTSVA-.l ) * 

+ . MRC  STARTR)  -SIC  STARTSVA-l ) "MIC  STARTR) 

. SIC  STARTSVA)  “ SIC  STARTSVA)  + SRC  STARTSVA-l ) '* 

+ . MIC  STARTR)  -SIC  STARTSVA-l ) "MRC  STARTR) 

C • MUI.TIPLY  DV  RECIPROCAL  DIAGOllALO 

, SRC  STARTS V,  17)  - RECIPCl,!))  + SRC  STAP.TSV.M) 

SIC  STARTSV,!!)  « RECIPCl,!))  + SIC  STARTS’/, 17) 

STARTS’/  “ STAP.TSV  + 17 
E!7D  FOR 
E17D 


First  Back  Substitution  for  LDL*  or  GE  with  L*  Stored  Rowwise  and  Componentwise 
. LOT  = S 

(see  comments  under  LDL*  Decomposition) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2  [N^] 

imaginary  part  of  L*  factor 

rowwise 

SR 

FP 

KN 

real  part  of  steering  vectors 

componentwise 

SI 

FP 

KN 

imaginary  part  of  steering 

vectors  componentwise 

RECIP 

FP 

N 

reciprocals  of  elements  of  D 

AUG 

I 

1 

current  steering  vector 

ROW 

I 

1 

current  row 

STARTSV 

I 

1 

location  of  beginning  of  ROWth 

components  of  steering  vectors 

SUBROW 

T 

1 

1 

row  from  which  multiple  of 

current  row  is  to  be  subtracted 

STARTSVS 

I 

1 

location  of  beginning  of  SUBROWth 

components  of  steering  vectors 

STARTR 

I 

1 

location  of  the  (ROW,  SUBROW )th 

element  of  the  SCM 
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BE(3IH 


SUBTRACT  MULTIPLES  OF  EOVfTH  COMPONEIITS  OF  STEERIKG  VECTORS 

FROM  SUCROHTII  COIIPOIIEIJTS 

STARTSV  “ 1 

STARTR  - 2 

FOR  I'.OV;  - 1 TO  II- 1 

STARTSUD  “ STAP.TOV+K 

FOR  Gunnovi  - row  + i to  n 

. . sn( GTARTOVS ,K)  “ SW S TARTS VS , K) 

+ . . -MRC  START!!  1 '•SRC  STARTSV , K ) 

♦ . . -MT(  STARTR) ' SK STARTSV, K) 

SI(  STARTS  VS,  IC)  " SK  STARTSVS  , IC ) 

+ . . -MIK  STARTR)  •■•Sn(  STAi>TSV,IO 

+ . . -I!R(  STARTR) '-GK  STARTSV, K) 

, . GTAR';’R  " STARTR  + 1 

STARTDVS  STARTGVG  + K 
RITO  FOR 

SR(  STARTGV,K)  '■  REC.ilR  ROVJ) '-SRI  STARTSV, K) 

S:C(  STARTSV, K)  “ RECUR  ROW)  "GK  STARTSV, K) 

STARTSV  " STARTSV  + K 
CSTARTR  “ STARTR  + ROVJ] 

FRO  FOR 

SRC  STARTSV,!;)  " RKC.tPC  1!)  "SRC  STARTSV, IC) 

SK  STARTS V,i;)  = RECIPC  ID  "SIC  u?ARTGV,K> 

EIID 

( ROTE  THAT  ALL  VECTOR  OPERATIonS  ARE  OF  LEKGTH 
K Aira  SO  TO  USE  THIS  ALGORITHM  K SHOoLT) 

UE  GREATER  TilAlJ  1) 
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First  Back  Substitution  for  LDL*  or  GE  with  L* 


- LOT  = S 

(see  comments 

under  LDL* 

Decomposi tion) 

Variable 

Type 

Length 

MR 

FP 

N(N+l)/2  [N^] 

MI 

FP 

N(N+l)/2  [N^] 

SR 

FP 

KN 

SI 

FP 

KN 

RECIP 

FP 

N 

T 

FP 

2N 

DOT 

FP 

1 

AUG 

I 

1 

ROW 

I 

1 

STARTS V 

I 

1 

STARTR 

I 

1 

STARTSVS 

I 

1 

Stored  Columnwise  and  S Vectorwise 


Contents 

real  part  of  L*  stored  component- 
wise 

imaginary  part  of  L*  stored 
componentwise 

real  part  of  steering  vectors 
vectorwise 

imaginary  part  of  steering 
vectors  vectorwise 
reciprocals  of  elements  of  D 
temporary  array  used  with  sum 
temporary  location 
current  steering  vector 
current  column  of  SCM 
location  of  first  element  of 
current  steering  vector 
location  of  beginning  of  current 
column  of  the  SCM 
location  of  current  element  of 
steering  vector  being  calculated 


BrGIIT 

ar/inTsv  - i 


C ELIMIWATE  EACH  STEEEIEG  VECTOR 

FOR  AUG  “ 1 TO  K 
ETARTR  " 2 

C • ELIMINATE  THE  2110  ELEMENT  OP  TRE  STEERIItG  VECTOR 

SR{  STARTEV  H)  “ CR(  STARTS V+. I ) -riRt  STARTR)  " 

+ . SR{  GTARTSV)  -riK  STARTn)  «ni(  STARTSV) 

SI(  STARTSV-M)  = CT(  ETARTGV+l ) -nR(STARTR)» 

+ . GI(  STARTSV)  +I1I(  GTARTR)  ■•■■•SR(  STARTSV) 

C " CLIMTKATE  THE  3Ra  THROUGH  HTH  ELEnEHTS  OP  THE  STEERirG  VECTORS 

STARTSVS  “ STARTGV+2 
FOR  rO'7  “ 3 TO  II 

. . T(l,no;7-.l.)  “ GR(  STARTSV,  ROV)-!  ) *I!n(  STARTR , R0V7-1 ) 

T(  R0;7,l;0V/-.l)  - S.T(  STARTSV, ROW-.l ) '■MK  GTARTR,  ROW-l ) 
nOT  “ sum  T(  .1  ,2'-l'!OV7-2)  ) 

SR(  STARTSVS)  " SR( STARTSVS)  -EOT 

T(  .l,R0V7-.l)  - - nR(STARTSV,RGV7-.l)’-llI(STAnTR,ROV7-.l) 

T(  ROW, now-.'.)  - SK  STARTSV, R0V7-.1)*IH:(  nT,AnTR,nOW-l) 

DOT  Gun(  T(  .1.  ,2''-nOW-2)  ) 

ST(  STARTSVS)  " ST(  GT.ARTDVS)  -DOT 
STARTSVS  " STARTSVSs-.l 

GTARTR  “ GTAliTl!->r.OW  CSTAUTR  " GTARTR+H3 
EHD  FOR 

SR(  STARTSV, II)  '■  SR(  STARTSV,  II) ’•rrnc.tPI  1 , ID 
SI(  STARTSV, ID  " SK  STARTSV , 11) ' rJICIPI  .1 , ID 
STARTSV  " DTARTSV+II 
EIID  FOR 
EIID 
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First  Back  Substitution  for  LDL*  or  GE  with  L*  Stored  Columnwise  and  S Componentwise 
-LDT  = S 

(This  method  is  identical  to  the  First  Back  Substitution  with  L*  stored 
rowwise  and  S componentwise,  except  in  the  calculation  of  subscripts  of  MR 
and  MI,  which  do  not  change  the  operation  counts.) 


I 


First  Back  Substitution  for  LL*  with  L*  Stored  Rowwise  and  Vectorwise 
- LT  = S - ^ 


(see  comments 

under  LDL* 

Decomposition) 

' 

Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2  [N^] 

imaginary  part  of  L*  factor 

rowwise 

SR 

FP 

KN 

real  part  of  steering  vectors 

vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering 

vectors  vectorwise 

RECIP 

FP 

N 

reciprocals  of  diagonal  alements 

of  L factors 

AUG 

I 

1 

current  steering  vector 

ROW 

I 

1 

current  row 

STARTSV 

I 

1 

location  of  first  element  of 

current  steering  vector 

STARTS VA 

I 

1 

location  of  ROW+lst  element 

of  current  steering  vector 

LENGTH 

I 

1 

length  of  ROWth  row  of  SCM 

starting  at  element  ROW+1 

STARTR 

I 

1 

location  of  ROW+1 st  element 

of  ROWth  row  of  SCM 
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c 


c 


CEGin 

STARTSV  - 1 

DO  EACH  STEERIBO  VECTOR 
FOR  AUG  - 1 TO  K 


SUBTRACT  MULTIPLES  OF  EACH  ROW  OP  M PROM  THE  STBBRinG  VECTORS 


STARTSVA  “ STARTBV+l 
STARTR  - 2 
LEtIGTII  - M-1 

roR  Rov;  “ 1 TO  n-2 

. GR( STARTOVA-1)  " RECIPC ROVn  "SW STARTSVA-l ) 

. SI(  CTARTOVA-1)  » RECIPC ROV;) *GI(  CTARTSVA-1) 

. SR(  STARTOVArLEEGTII)  “ DR(  STARTSVA, LEBOTII) 

-BR(  GTAnTSVA-l>«I!R(  STARTH.LERGTH) 

-GK  CTARTGVA-3.)''1V>;(  S'i'ARTR.LERGTH) 

. SI(  STARTSVA.LERGTH)  “ GK  CTART3VA,LEIIGT!n 
. +SR(  STARTSVA-1)  «in(  STARTR.LEITGTH) 

-SI(  GTARTSVA-1 ) "lUH  GTARTR.LEKGTH) 

STARTSVA  " STARTSVA+L 

. STARTR  “ GTARTR  + LEIiGTII  + 1 CSTARTH  » STARTH  + N + 13 
LEI.'GTI!  - LEI7GTH-1 
EtJD  FOR 

GR(  GTARTGVA-1 ) " RECIPC  B-l ) ''SR(  STARTGVA-1) 

GK  STARTSVA- 1)  “ RRCIPC 11-1 ) "GK  nTARTSVA-.l) 

Gr.(  STARTSVA)  " CR(  GTARTOVA)  -CRl  STARTSVA-1 ) • 

tim  GTARTR ) - C K STARTSVA- 1 ) "UK  GTARTR ) 

GK  STARTSVA)  " CK  STARTCVA)  <■  SR(  GTARTSVA-1 ) " 

nj;(  GTARTR)  - CV(  GTARTSVA-1 ) ■■IlRt  GTARTR) 

Gn( GTARTGVA)  “ RECKH  H) "CR( STARTSVA) 

GK  STARTSVA)  - RKC.TPl  D)  "GTl  STARTSVA) 

STARTSV  “ STARTSV  + 11 


EIID  FOR 
EtlD 
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First  Back  Substitution  for  LL*  with  L*  Stored  Rowwise  and  Componentwise 
. LT  = S ^ ^ 


(see  comments 

under  LDL* 

Decomposition) 

Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2  [N^] 

imaginary  part  of  L*  factor 

rowwise 

SR 

FP 

m 

real  part  of  steering  vectors 

componentwise 

SI 

FP 

m 

imaginary  part  of  steering 

vectors  componentwise 

RECIP 

FP 

n 

reciprocals  of  diagonal  elements 

of  L factor 

AUG 

I 

1 

current  steering  vector 

ROll 

I 

1 

current  row 

STARTSV 

I 

1 

location  of  beginning  of  ROWth 

components  of  steering  vectors 

row  from  which  multiple  of 

current  row  is  to  be  sub- 
tracted 

STARTSVS 

I 

1 

location  of  beginning  of  SUB- 

ROWth  components  of  steering 

vectors 

STARTR 

I 

n 

8-53 

location  of  the  (ROW,  SUBR0W)th 

element  of  the  SCM 

CEGIH 


C SUBTHACT  IIULTIPLEG  OF  nOVJTI!  COIIPOMEIITS  OP  STESHIEG 


vECTons  pnoM  gudrov;tii  compoheijts 

ETAPTSV  - 1 

ETAnyn  - 2 

FOR  nov;  " .1  TO  n-i 

STARTGVG  " .'ITARTEV+K 

. SRC  GTAP.'JGV,!:)  = REGTin  ROW)  *SR(  STAPTSV , K ) 

SIC  sviuvi'sv.ic)  " ni.GiF;  roc;)’  Si(  sTA'vrsv.K) 
FOR  Dur.now  - to  n 

Si’.C  STARTrA-S,!:)  = SRC  STAP.'i'SVS  ,'C) 

+ . . -in;c  nTAr;TR)-sRc  sTA)ri’S7,K) 

+ . . -MIC  STAr.Tn)*S:rC  S7ARTSV,K) 

SIC  GTARTGVS.IC)  SIC  S'. ARTSVS  ,K) 

+ . . < ni  c sTAR'ii: ) fi  nc  s'/AiiTs v , k ) 

+ . . -MAC  SI'/ARTR)  'RiK  S'PARTSV.K) 

START!’  “ STAR'TR  + 1 
S'TARVSVS  " STARTSVS  + E 
EIID  FOR 

STAR'TSV  “ STARTSV  + K 
CSTARTR  “ STARTR  + ROCM 
EMD  FOR 

SRC  STARTSV, K)  - REMIpc  n)  ••••SRC  STARTSV, K) 

SIC  STARTSV, IC)  = RECIPC  II) '-SIC  STARTSV, K) 

EIID 

C rOTE  THAT  ALL  VEGTOP.  OPERATIOIIS  ARE  OP 
LRESTn  K All)  SO  TO  USE  THIS  A,T.Conv'Ti:M 
!C  SHOULD  UE  CREATiU’.  THAM  1) 


B-59 


,t  Back  Substitution  for 

- 

- LT  = S - 

LL*  with  L*  Stored 

(see  comments 

un^r  LDL 

* Decomposition) 

Variable 

Type 

Length 

MR 

FP 

N(N+l)/2.  [N^] 

MI 

FP 

N(N-H)/2  [N^] 

SR 

FP 

KN 

SI 

FP 

KN 

RECIP 

FP 

N 

T 

FP 

2N 

DOT 

FP 

1 

AUG 

I 

1 

ROW 

I 

1 

STARTS V 

I 

1 

STARTR 

I 

1 

STARTSVS 

I 

1 

Columnwise  and  S Vectorwise 

Contents 

real  part  of  L*  stored  component- 
wise 

imaginary  part  of  L*  stored 
componentwise 

real  part  of  steering  vectors 
vectorwise 

imaginary  part  of  steering 
vectors  vectorwise 
reciprocals  of  elements  of  D 
temporary  array  used  with  sum 
temporary  location 
current  steering  vector 
current  row  of  SCM 
location  of  first  element  of 
current  steering  vector 
location  of  beinning  of  current 
row  of  the  SCM 

location  of  current  element  of 
steering  vector  being  calculated 


■5 
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BCGin 

CTAHTSV  - 1 

C ELIMIHATE  EACH  STEEHItiG  VECTOR 

I’OR  AUG  - 1 TO  K 

STARTR  - 2 

SR(f.TAnTSV)  - eR(  CTAnTSV)*RnCIP(  1) 

. SKSTARTSV)  - QK  STARTSV)  "RECIPC  1 ) 

C * ELIMII7ATE  THE  211D  ET.EMEIIT  OP  THE  GTESRIICG  VECTOR 

Sn(  STARTSV+1)  - Gn( STARTSV+l)  - MR(  STARTR)* 

♦.  CR(  ETART.'iV)  - I!I(  GTARTn)  "SK  STARTSV) 

SI(  STARTSV+l)  - EI(  STARTSV+1)  - MR( STARTR)* 

+ . SKSTARTSV)  + MK  STARTR)  "SR(  GTAHTSV) 

GR(  STARTSV+l)  “ GR( STARTSV+l > 'TECIPC 2 ) 

SIC  STARTSV+1)  “ SK  STARTSV+l  )*r.ECIP(  2) 

C * ELIMItJATE  THE  3nD  THROUGH  IJTH  ELEIIEHTS  OP  THE  STEERING  VECTORS 

. STARTSVS  “ STAnTSV+2 
. FOR  ROV)  " 3 TO  N 

. . T(l,nO\;-l)  - SR(  STARTSV,ROV7-l)''r!R(  STARTR, RO’, 7-1) 

T(  ROW,ROV!-1)  - nit  CTARTGV,ROV;-l)*MI(  STARTR, nOV/-l) 

DOT  - SUMt  T(  l,2*ROV7-2)  ) 

SR(  STARTSVS)  “ SRt  STA'ITGVC ) - DOT 

T(l,ROV7-l)  - -SRt  GTARTr,V,ROV7-l)"!lI<  STARTR, RO'7-1) 

T(  ROV7,nOV7-l)  - SK  nTAnTSV,ROt7-l)*tin(  STARTR, RO’.7-l) 

DOT  “ SUr.tTt  l,2*ROV7-2)) 

. . SK STARTSVS)  - SK STARTSVS)  - DOT 

STARTS  VS  ■’  G TARTS  VS + 1 

. . STARTR  “ GTARTR+ROt7  CSTARTR  - STARTR  + HI 

. . SRI  fJTAR^’SVS)  “ RECIPt  ROW)  *sn(  STARTSVS) 

SK  STARTSVS)  - RECIPt  ROV7)  "SK  STARTSVS ) 

. EllD  FOR 

STARTSV  “ STARTSV  + N 
END  FOR 
END 


B-61 


First  Back  Substitution  for  LL*  with  L*  Stored  Columnwise  and  T Componentwise 
- LT  = S-H_U 

(This  method  is  identical  to  First  Back  Substitution  with  L*  stored 
rowwise  and  S componentwise,  except  in  the  calculation  of  subscripts  of  MR 
and  MI, which  does  not  change  the  operation  counts.) 


B-62 


Second  Back  Substitution  for  LDL*  or  6E  with  L*  Stored  Rowwise  and  S Vectorwise 

- L*W  = T-  - 

^ ^ 

(see  comments  under  LDL*  Decomposition) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2 

imaginary  Dart  of  L*  factor 

rowwise 

SR 

FP 

KN 

real  part  of  steering  vectors 

vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering 

vectors  vectorwise 

T 

FP 

2N 

temporary  array  used  with  sum 

DOT 

FP 

1 

temporary  location 

LAST 

I 

1 

location  of  last  element  of  SCM 

AUG 

I 

1 

current  steering  vector 

ROW 

I 

1 

current  row  of  SCM 

STARTSV 

I 

1 

location  of  last  element  of  current 

steering  vector 

STARTR 

I 

1 

location  of  ROW+lst  element  of 

ROWth  row  of  the  SCM 

STARTSVS 

I 

1 

location  of  ROW+lst  element 

of  current  steering  vector 

LENGTH 

I 

1 

length  of  ROWth  row  of  SCM 

starting  at  element  ROW+1 
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nnoin 

SVAnTSV  = H 

LAST  “ n-*(II+l)/2  [LA3T  " 


c ni.xniimTE  men  GTEBnins  vector 

rOR  A’ilG  “ 1 TO  K 

STAETn  « EAST  -1  CDTARTR  ■’  LAST  -HI 
C * CALCULATE  Il-.IST  COir’CEEnT 


SR(  STARTGV  -1)  ” SEC  STARTSV-l ) --I'.’f  STAP.TE)  " 

Sr.(  STARTS'/)  -I-  I'  C C STAn'.'R)  "S'K  STA^’TSV) 

5I(  STARTSV-l.)  - SK  STARTSV-l)  - nn(  STAr.'VR) 

SKSTARTSV)  - nv(  STAP.TP.) ’-SiU  START3V) 

STARTSVS  “ STARTSV-l 
LEIIGTII  “ 2 

I’OR  r.ov;  " n-2  to  i step  -x 

START;!  ^ STARTR  - j.ERCTn  -1  fSTAP-TR  " STARTR  -IT  -1] 
T(l,Lni:GTU)  '■  HRC  tJTARTR  , LEEGTII) '' 

SR!  STAj'.TS  vS  , I.EEOTil ) 

T(  LEI!OTI!+ 1 , LEECTV; ) -IM!  STARTR  , LEEGTiD 
*GI(  STARTSVS  .LSEGTII) 

DOT  “ sunt  T(  1,2 ’■'LEUGTn)  ) 

SRC  ST?.r.TSVS-l)  " Sj;(  STARTSVS-l)  - EOT 
T(  1 , J.ERGTTI)  " MR' STA.RT!!  ' ) '''SX(  fRTARTSVS  , LEi’GTH) 

TCLEKGTn-M,  i.EnGTi:)  M C' STARVE , LEEGT't) '-SRC  STARTSVS  , LEEGT!;) 
DOT  =■  suit  ( T(  1,2 EEGTri)  ) 

. DK  STARTSVa-l)  ■■  SK  STARTSVS-l ) - DOT 
STARTSVS  “ START!;VS  -1 
. IiiRDTII  " I.-EliGTll  1 
EirO  I'OR 

STARTS’/  “ STARTSV  + IT 


i:r7D  roR 


DUE 


B-64 


Second  Back  Substitution  for  LDL*  or  GE  with  L*  Stored  Rowwise  and  S Componentwise 
- L*W  = T - 

(see  comments  under  LDL*  Decomposition) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+I)/2  [N^] 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2  [N^] 

imaginary  part  of  L*  factor 

rowwise 

SR 

FP 

KN 

real  part  of  K steering  vectors 

componentwi se 

SI 

FP 

KN 

imaginary  parts  of  K steering 

vectors  componentwise 

ROW 

I 

1 

current  row  of  SCM 

STARTSV 

I 

1 

beginning  location  of  ROWth 

components  of  the  steering 

vectors 

SUB ROW 

I 

1 

row  of  components  from  which 

multiple  of  ROWth  row  is  subtracted 

STARTR 

I 

1 

location  of  {SUBROW,  R0W)th 

element  of  the  SCM 

STARTSVS 

I 

1 

beginning  location  of  SUBROWth 

components  of  the  steering  vectors 
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cnGin 

STAni'SV  - K"(N-.l)  +1 

C SUDTHACT  I’ULTIPLKG  OP  EACH  COMPOIIEIIT  FROU  PREVIOUG  COriPOlinilTS 

FOR  ROVf  - n TO  ?.  STEP  -1 
aVARTR  " ROW 
GTi'.nvnvr.  - i 

ron  GijRRovj  .1.  TO  nov;  -i 

£;!■;(  GTART.CVC  ,R)  " ER' OTARTCVn  , TO 
+ . . -!!R(  START!’ ) '■  !;«(  STARTS’/ , I< ) 

+ . . +.'’.C(  f,TA!’.T!!>  ■■ST(  r:VA':TS’/,K) 

EI(  STARTSVS.IO  " ST!  STARTGVS , !0 
♦ . . -tlRC  DTARTR)  ■•■GaC  STARTS’/, K) 

♦.  . -in(  DVARTR) '''SIU  SVanTS’/.’t) 

STARTR  " STAl’.TR  II  -SU'SRO!!  CSTARTR  " STAP.TR  + 111 
. . STARTSVS  " GTARTOVS  + IC 

r.VT)  FOR 

STARTSV  " STARTSV  -K 
Ein  ron 

Eli" 

(ROTE  r.  SIIOTILD  EE  GREATER  THAU  1) 
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1 


Second  Back  Substitution  for 

LDL*  or  GE  with  L* 

Stored  Columnwise  and  S Vectorwise 

L 

-L*W  = T-. 

-q] 

(see  comments 

under  LDL* 

Decomposi tion) 

Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

real  part  of  L*  columnwise 

MI 

FP 

N(N+l)/2  [N^] 

imaginary  part  of  L*  columnwise 

SR 

FP 

KN 

real  part  of  steering  vectors 

vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering  vectors 

vectorwise 

LAST 

I 

1 

location  of  the  last  element  of 

SCM 

AUG 

I 

1 

current  steering  vector 

STARTSV 

I 

1 

location  of  first  element  of 

current  steering  vector 

ROW 

I 

1 

current  column  of  SCM 

STARTR 

I 

1 

beginning  location  of  current 

column  of  SCM 

STARTVS 

I 

1 

location  of  ROWth  element  of 

current  steering  vector 

LENGTH 

I 

1 

length  of  current  column  of  SCM 

excluding  diagonal  element 
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DEGin 


ELIIlIimTE  EACH  STEERIKG  VECTOR 
GTAHTSV  - 1 

I.AGT  “ n'(lJ+l)/2  [LA3T  - tl»«] 

FOR  ATJG  “ 1 TO  !C 
. GTART.GV2  - STAF.TGV  + IC  -1 
. nvAHTn  " EAST  - n <•  1 
i.nrcTri  " n -i 

FOR  row  " II  TO  3 STEP  -1 

SRI  STAnTSV,I.i;i'GTII)  sni  GTAP.TSV.LErGTri) 

+ . • -SRI  STAF.TSVS  ) ’■■nsi  S VARTr , EKRGTH) 

+ . . +r.3;i  sTARTSvi;)'!!-;!  sTAr;Tr,Lni:GTn) 

SII  GTARTGV.LEIIGTin  " S TI  CTAP-TSV , LEEG'.C.T ) 

+ . . -SRI  DTArTSVS  ) •"I  IXI  STAP.V R , LEKgTH ) 

+ . . -SXI  STAF.TSVS  ) 'T;RI  STARTR  , LEEGTII) 

GTAF.TSVS  " ST;\F.TSVG-.1 
STARTR  “ STARVR  - LEKGTIi 
I.EEGVn  " EKi:G'i’II  -1 
EIID  FOR 

SRIGTARTGV)  - SP.I  STARTSV)  -SHI  STAnTSVS  ) ’» 

+ . IIP.' STARTR)  + SXI  ST.APTSVO’-IIXI  STARTR) 

. SII  STARTSV)  '•  SII  STARTSV)  - SRI  fi'TA.PTSVS ) * 

+ . r.X(  STAR'TR)  -SXI  STAR.TSVS  ) •■H.RI  STARTR) 

STARTSV  '■  STARTSV  + K 
EIIO  FOR 
EUR 
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Second  Back  Substitution  for  LDL*  or  GE  with  L*  Stored  Columnwise 
and  S Componentwise 

-L*M  . T-^0  B 

(This  method  is  identical  to  the  Second  Back  Substitution  with  L*  stored 
Rowwise  and  S Componentwise,  except  in  the  calculation  of  subscripts  for  MR 
and  MI,  which  does  not  change  the  operation  counts.) 


Second  Back  Substitution  for  LL*  with  L*  Stored  Rowwise  and  S Vectorwise 


- L*W  = T - 


(see  comments  after  LDL*  Decomposition 


) 


Variable 

Type 

Length 

MR 

FP 

N(N+1)/?  [N^] 

MI 

FP 

N(N+l)/2  [N^] 

SR 

FP 

KN 

SI 

FP 

KM 

RECIP 

FP 

N 

T 

FP 

2N 

DOT 

FP 

1 

LAST 

I 

1 

AUG 

I 

1 

ROW 

I 

1 

STARTS V 

I 

1 

STARTR 

I 

1 

STARTSVS 


Contents 

real  part  of  L*  rowwise 
imaginary  part  of  L*  rowwise 
real  part  of  steering  vectors 
vectorwise 

imaginary  part  of  steering  vectors 
vectorwise 

reciprocals  of  diagonal  elements 
of  L* 

temporary  array  used  with  SUM 

temporary  location 

location  of  last  element  of  SCM 

current  steering  vector 

current  row  of  SCM 

location  of  last  element  of  current 

steering  vector 

location  of  ROW+lst  element  of 
ROWth  row  of  the  SCM 
location  of  ROW+lst  element  of 
current  steer  nq  vector 
length  of  ROWth  row  of  SCM 


DEGItl 

STTVHTSV  - N 

LAST  - M*(ll+l)/2  CLAST  - H»N] 

C nLIMIHATE  EACH  ETEEnillG  VECTOH 

FOR  AUG  - 1 TO  K 

. STARTR  - LAST  -1  CSTARTR  » LAST  - N] 

C • CALCULATE  IITU  ELEMEUT 

smsTARTsv)  - cm  cTARTsv)'’r.ncipr  n) 

. SKSTARTSV)  - SI(  STARTSV)  "RECIPC  tl) 

C • CALCULATE  N-1  COMPOllEHT 

Sm  STARTSV-1)  " cm  BTARTCV-1)  -I1R(  STARTR)  * 

+ . CR(CTARTEV)  + I!I(  STARTR) '’SK  STARTSV) 

SI(  STARTSV-l)  ” OK  STARTSV-.l)  -I '.R(  STARTR)  <* 

♦ . SK  STARTSV)  -in( STARTR) *0n(  OTART3V) 

ER(  STARTOV-l)  - SP.(  GTAR.TOV-.l ) «RECIP(  t)-l ) 

OK  STAR.TSV-l ) - OK  OTAP.TOV- 1 ) "RECT.P(  U-1 ) 

STAR.TOVG  “ STARTGV-1 
. LBHGiTI  - 2 

FOR  r.OV7  - n-2  TO  1 STEP  -1 

STARTR  - STARTR  - LERCTH  -1  CSTARTR  “ STARTR  -11  -.13 
'Kl.LLRGTII)  " nR(  STARTR, LERGTII)'' 

+ . . Sn(  OTAI.TSVS , T.EtlGTII ) 

TCLEKGTII  + .1.  ,LEtrjTII)  " - P.K  STARTR, LEllGTH)  •* 

+ . . S K STAJ'.TSVO  , T.EIICTII ) 

DOT  " sunt  T{  1,2’-LI.T!GTH)  ) 

sr.(  ST.AR.TSVS-l.)  “ om  ETARTSVS-.).)  - DOT 

T(  l,LEI10Tn)  " 1!R.(  r.TAr.Tr.,LEi;GTTl)'>r'>T.(  0TAnTS73,T.,EllGT:i) 

KLEIIGTH  + .l.,X,EnCT!I)  " J’.K  START.R/LEl.'ST:!)  "GRI  ST.ARTSVS  ..LErPTI!) 
DOT  " GUM(T(  .l,2«LEWGTin) 

GK  DTA1!TGVS-.1.)  " GK  GTAR.TGVG  -l)-DOT 

GR.(  GTAR.TGVG-l  ) = CR.C  STARTGVG-.l ) '■REC.TPC  ROW) 

GK  GTARTOVG-.l)  " DK  CTAUTGVG-3. ) ’■'REC.XPl  ROVl) 

STARTGVG  “ GTARTGVS-1 
. . LilliOTU  “ LEMOTII  -1 

Elin  FOR 

STARTGV  " GTARTGV  + 11 
EIID  FOR 
EDO 


Second  Back  Substitution  for 

LL*  with  L* 

Stored 

Rowwise  and  S Componentwise 

-L*W  = T- 

(see  comments 

under  LDL*  Decomposition 

Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2 

[N^] 

real  part  of  L*  rowwise 

MI 

FP 

N{N+l)/2 

[N^] 

imaginary  part  of  L*  rowwise 

SR 

FP 

KN 

real  parts  of  K steering  vectors 

coTiiponentwise 

SI 

FP 

KN 

imaginary  parts  of  K steering 

vectors  componentwise 

RECIP 

FP 

N 

reciprocals  of  diagonal  elements 

of  L* 

ROW 

I 

1 

current  row  of  SCM 

STARTSV 

I 

1 

beginning  location  of  ROWth 

components  of  the  steering  vectors 

SUBROW 

I 

1 

row  of  components  from  which 

multiple  of  ROWth  row  is  subtracted 

STARTR 

I 

1 

location  of  (SUBROW,  R0W)th  element 

of  the  SCM 

STARTS VS 

I 

1 

beginning  location  of  SUBROW^^ 

components  of  the  steering  vectors 


w a 


DEGItl 

STARTSV  - K*(U-1)+1 


SUDTP.ACT  riULTIPLES  OF  EACH  COMFOnEtlT  FROIl  PP.GVIOTJS  COMPOTIEIITS 


FOP.  nOU  " 1/  TO  2 STEP  -1 
STAPTn  - ROV; 

□TARTS vs  - 1 

sn(  GT.AP.TSV.K)  - Er(  ETARTGV.IO'T.ECIPC  now) 
SI(  STAP.TSVjK)  - ni(  STARTSV,1C)''RECIP(  ROW) 

FOR  sunnow  - i to  r.ov7-i 

SR(  START5VG,K)  - CR( GTARTSVS ,K) 

-IlRt  STARTR ) "SnC  CT.ARTSV , K ) 
+Mi(  □TAr;'rp.)''r,v(  cvARTSV.K) 

. . SI( STARTSVS.K)  " SK CTARTSVa , K ) 

-M?.(  CTARTR)'-;i;.:(  CTARTOV.K) 
-MI(  GTARTR ) • SRC  STARTSV , K ) 
STARTR  " STARTR  IJ  -SUDnOW  tSTAP.TR  ' 
STARTSVD  - STARTSVS  + IC 
EtTO  FOR 

STARTSV  - STARTSV  - K 
EHD  FOR 

R(  .l,K)  - SRC  l,K)«*rECIP(  1) 

ICl.lC)  - SIC  1,K>*RECIPC  1) 

EIJD 

C DOTE  THAT  K SnOUI,D  CE  GREATER  TRAtl  1 ) 


TAP.TR+r)] 


Second  Back  Substitution  for  LL*  with  L*  Stored  Columnwise  and  S Vectorwise 


- L*T  = S 

(see  comments 

after  LDL* 

Decomposi tion) 

Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2  [N^] 

real  part  of  L*  columnwise 

MI 

FP 

N(N+l)/2  [N^] 

imaginary  part  of  L*  columnwise 

SR 

FP 

KN 

real  part  of  steering  vectors 

vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering 

vectors  vectorwise 

RECIP 

FP 

N 

reciprocals  of  diagonal  elements 

of  L* 

LAST 

I 

1 

location  of  the  last  element  of 

SCM 

AUG 

I 

1 

current  steering  vector 

STARTSV 

I 

1 

location  of  first  element  of 

current  steering  vector 

ROW 

I 

1 

current  column  of  SCM 

STARTR 

I 

1 

beginning  location  of  current 

column  of  SCM 

STARTSVS 

I 

1 

location  of  ROWth  element  of 

current  steerii ] vector 

LENGTH 

I 

1 

length  of  current  column  of  SCM 

excluding  diagonal  element 

B.  74 


EEGin 

C ELIIIINATC  EACH  STEERING  VECTOR 

F-TARTSV  - 1 

LAST  “ N*(tl+l)/2  CLAST  « H*N3 
rOR  AUG  - 1 TO  IC 

STARTSVS  “ STARTEV  ♦ K-1 
STARTR  - LAST  - tJ+1 
LENGTH  - N-1 

FOR  ROV;  - n TO  3 STEP  -I 

SR(  STARTSVS)  - SR(  STARTSVC  ) '■RECIPC  Rom 
SK  STARTSVS)  " SK  GTARTCVG  ) '^R!;CIP(  ROV)) 

Sn(  STARTSV, LENGTH)  - SRC  STARTS V, LENGTH) 

♦ . . -GR;  STARTSVS )'  nn(  STARTA, LENGTH) 

+ . . +EI(  SVARTSVG)«I1I(  STARTR, LENGTH) 

. . SK STARTSV, LENGTH)  " SK STARTSV, LENGTH) 

+ . . -GR(  GVARTSVS  ) '-riK  STARTR , LENGTH ) 

♦ . . -SK  STARTSVS ) 'HRC  STARTR, LENGTH) 

STARTSVS  " STARTSVS  -X 
STARTR  - STARTR  -LENGTH 

LENGTH  - LENGTH  -1 

END  FOR 

C • CALCULATE  ZIJD  ELEMENT  OP  STEERING  VECTOR 

. Sn(  r.TARTGVS  ) - SR(  STARTSVS  )"rECIP(  2) 

SK  STARTSVS  ) - SK  GT,ARTCVG  ) “RECIPC  2 ) 

C • CALCULATE  ICT  ELEMENT  OF  STEERING  VECTOR 

. SR(  STARTSV)  - SRC  STARTSV)  -SRC  STARTSVS  ) ■» 

+ . MR( STARTR)  + SK STARTSVS ) HIK  STARTR) 

SK  STARTSV)  - SK  JiTAR'.' SV)  -SR{  STARTSVS  ) « 

+ . UK  D VARTi! ) -S K STARTS  v S ) '■inH  START  .1 ) 

Sn(  STARTSV)  “ SR(  STARTSV)  "RnCKM  1) 

SK  STARTSV)  - SK  STARTSV ) "RECIPt  .1 ) 

STARTSV  " STARTSV+K 
NI")  FOR 
EIIU 


Second  Back  Substitution  for  LL*  with  L*  Stored  Columnwise 


and  S Componentwise 


-L*T  = S-'^ 


(This  method  is  identical  to  the  Second  Back  Substitution  with  L* 
stored  Rowwise  and  ? Componentwise,  except  in  the  calculation  of  subscripts 
of  MR  and  MI,  which  does  not  affect  the  operation  counts.) 


Appendix  C.  CDC  STAR- 100  Software 


SUMMARY  AND  DOCUMENTATION 

The  subroutine  CHOLDC  factors  the  positive  definite  Hermi ti an  matrix  M 
★ 

into  its  LL  decomposition  using  the  Cholesky  algorithm.  The  factor  L is 
stored  in  place  of  the  input  matrix  M.  Subroutine  BAKSUB  takes  this  lower 
triangular  matrix  L and  a vector  S as  input  and  solves  the  system  MW=S  for 
W by  performing  two  back  substitutions. 

CHOLDC 

Algorithm:  Since  M is  positive  definite  Hermitian  it  has  a factoriza- 
★ 

tion  of  the  form  LL  where  L is  a lower  triangular  matrix  with  real  positive 

* 

diagonal  entries  and  L is  its  conjugate  transpose,  as  discussed  in  the  section 
on  mathematical  techniques.  If  N is  the  dimension  of  M,  M = {m^.j}  and 
L = {Jl.  ■}  we  have 


0 

where  the  null  sum  ^is  taken  to  be  0 and  the  square  root  is  positive. 
k=l 

Using  these  equations,  one  can  successively  solve  for  the  first  column 
of  L,  £22*  second  column  of  L the  k^^  column  of  L,...,  and 


C-1 


INPUTS: 


NAME 

TYPE 

COIiMON 

CONTENTS 

MATDIM 

I 

blank 

Dimension  of  the  matrix  <200 

REEL(20100) 

R 

blank 

Real  part  of  the  lower  half  of  matrix  M 

R R R R R 

stored  columnwise:  m-i-.  .m^,, ,. . .m^,  ,m55,mo,, 
R R R 'R  R 

"’N2’  • • • ’’"kk*  k+1  k’  • • • ’""nK’  • • •"’nn* 

R / r 

where  m. . = m. . + jc.mi  ■ 
ij  ij  iJ 

0MAG(20100) 

R 

blank 

Imaginary  part  of  the  lower  half  of 

the  matrix  M stored  the  same  way  as  REEL. 

OUTPUTS: 

NAME 

TYPE 

COMMON 

CONTENTS 

REEL (201 00) 

I 

blank 

Real  part  of  factor  matrix  L stored 

columnwise  as  above. 

OMAG(20100) 

I 

blank 

Imaginary  part  of  factor  matrix  L stored 

as  above. 

RECIP(200) 

I 

blank 

Reciprocals  of  diagonal  elements  of  L 

(all  positive  real  numbers  by  Equation  (l ) ). 
Comments  (letters  refer  to  labels  in  listing): 

A.  This  routine  was  designed  to  work  in  a system  with  the  following 
characteristics . 

1)  MW=S  to  be  solved  for  arbitrarily  many  different  S vectors  for 
a given  M matrix. 

2)  Frequent  changes  of  matrix  M (speed  a paramount  "actor). 

3)  Infrequent  changes  of  dimension. 

The  vague  words  "arbitrarily  many",  "frequent"  and  "infrequent"  are  used  since 
they  are  a function  of  each  system  and  may  vary  considerably.  Our  goal, 
however,  is  to  find  upper  bounds  in  speed. 


Since  the  changes  in  dimension  are  infrequent,  we  chose  to  compute 

the  most  frequently  used  vector  descriptors  once  for  each  new  dimension. 
MATOLD  contains  the  old  dimension  and  if  it  disagrees  with  the  new  one 


in  MATDIM,  new  descriptors  are  calculated.  Not  all  descriptors  are  calculated: 
temporary  ones  are  assigned  in  lines  91  and  93  (numbers  on  left-hand  side  of 
page)  of  CHOLDC,  since  each  one  is  only  referenced  once  in  a call  to  CHOLDC 
and  there  are  almost  as  many  of  them  as  elements  in  the  matrix  ((MATDIM-2)* 
(MATDIM-1)  versus  MATDIM* (MATDIfH-l ) , so  storing  them  all  would  require  a 
great  deal  of  storage. 

It  is  possible  to  attain  a slight  increase  in  speed  in  this  section  of 
code  (at  the  expense  of  readability)  by  making  the  calculation  of  MATOLD, 

MTDI^l , MTDMM2,  MTDMPl,  and  MTDMP2  a single  vector  addition  and  equivalencing 
the  individual  variable  names  to  different  elements  of  the  vector.  This  may 
seem  like  a small  point,  but  it  shows  how  concerned  vie  are  about  doing  every- 
thing possible  to  speed  up  the  program. 

The  following  descriptor  arrays  are  calculated  for  J=1 ,2, . . . ,MATDIM-1 : 

CONTENTS 

Points  to  jth  column  of  real  part  of  matrix  M (and 
later  L)  stored  in  array  REEL. 

Points  to  column  of  imaginary  part  of  M and  L 
in  array  JfiAG. 

Points  to  jth  column  excluding  diagonal  elements  of 
real  part  of  matrices  M and  L. 

Points  to  column  excluding  diagonal  element  of 
imaginary  part  of  M and  L. 


NAME 

DREEL(J) 

DJMAG(J) 

DREELl(J) 

DJf^Gl  (J) 
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NAME 

CONTENTS 

DTEMPR(J) 

Points  to  the  real  part  of  the  M vector  in  array 

TEMPR  from  the  J+1^^  element  to  the  end. 

DTEMPJ(J) 

Points  to  the  imaginary  part  of  the  U vector 

in  array  TEMPO  from  the  J+1^^  element  to  the  end 

DDTTMP(J) 

Points  to  the  first  MATDIM-J  elements  of  the 

temporary  array  DOTTMP. 

The  MATDIM-1^^  elements  of  these  last  four  arrays  are  never  used  but 
are  calculated  to  avoid  an  IF  statement  in  the  loop  or  another  DO  loop,  under 
the  assumption  these  alternatives  would  take  longer. 

These  last  four  arrays  are  used  primarily  in  subroutine  BAKSUB  but  since 
BAKSUB  may  be  called  more  than  once  (for  different  values  of  the  S vector) 
for  one  call  to  CHOLDC,  the  calculation  was  done  here  to  avoid  unnecessary 
checking  for  changing  dimensions,  so  that  the  instruction  stack  is  not 
changing  unnecessarily  often.  In  the  final  implementation  this  function 
of  calculating  descriptors  would  probably  be  done  by  some  executive  routine. 

B.  Since  only  the  reciprocals  of  diagonal  elements  of  the  L matrix 
are  needed  to  both  calculate  the  columns  of  the  L matrix  and  perform  the 
back  substitutions,  they  are  stored  in  array  RECIP. 

We  assumed  in  coding  that  the  compiler  would  optimize  constant  subscripts 
to  be  equivalent  in  speed  to  using  equivalenced  nonsubscripted  variables. 

If  this  is  not  so,  we  would  equivalence  nonsubscripted  variables  to  REEL(l), 
RECIP(l),  DREEL(l),  DJI'IAG(l)  and  to  similar  references  elsewhere  and  use  them 
instead. 

Since  we  know  the  diagonal  elements  of  L are  real  and  only  use  their 
reciprocals  anyway,  it  is  unnecessary  to  calculate  them  and  so  we  could  just 
use  DREELl(l)  and  DJMAGl(l)  here,  shortening  two  vector  multiplies  by  one 
element  each. 
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C,  The  vector  references  DREEL(JCOL)  and  DJMAG(JCOL)  are  used  in 


the  loop  described  in  section  D,  so  we  stored  them  in  nonsubscripted 
variables  DTR  and  DTJ  in  the  hopes  of  making  their  use  more  efficient  in 
the  loop.  This  may,  however,  be  unnecessary  since  the  STAR  has  255  registers, 
which,  if  used  efficiently,  could  hold  these  descriptors  for  the  duration 
of  the  loop.  In  other  words,  the  compiler,  upon  first  coming  across  the 
descriptor,  would  set  aside  a register  for  it  for  the  duration  of  the  loop, 
making  the  allocation  of  an  extra  memory  location  pointless.  Similarly, 
the  variables  IFIRST,  LEN,  JCOLMl , MATDIM  and  the  ones  depending  on  MATDIM 
should  be  kept  in  registers. 


D.  The  four  values  restored  in  scalars  in  the  beginning  of  the  loop 
were  restored  for  the  same  reason  as  DREEL(JCOL)  and  OJAG(JCOL)  in  part  C 
and  the  same  comments  apply  here. 

We  know  the  imaginary  part  of  the  diagonal  elements  of  L are  0,  yet 
we  calculate  them  here  to  avoid  extra  vector  references.  The  alternative  is: 

Part  C:  JFIRST=1 

LENM1=MTDMM1 
JCOLMl =0 

DO  3 JC0L=2,MTDMM1 

I FI RST= I F I RST+MTDMP2- JCOL 
ASSIGN  DTR,  DREEL(JCOL) 

ASSIGN  DTJ,  DJMAGl(JCOL) 

LEN=LENM1 
LENM1=LENM1-1 
ISUB=JC0L 
JCOLMl =JC0LM1+1 


Part  D:  DO  4 SUBC0L=1 , JCOLMl 

TR=REEL(ISUB) 

TJ=JMAG(ISUB) 

ISUBP1=ISUB+1 

DTR=DTR+REEL ( I SUB ; LEN )* ( -TR ) 
+JMAG(ISUB;LEN)*(-TJ) 
DTJ=DTJ+REEL(1SUBP1 ;LENM1 )*(TJ) 
+JMAG(ISUBP1 ;LENM1 )+(-TR) 
ISUB=ISUB+MATDIM-SUBCOL 
4 CONTINUE 
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Calling  this  Method  2 to  contrast  it  to  Method  1 in  the  listing,  we 
have  the  following  table  of  "extra  operations  performed": 


j 

Method  1 

Method  2 

■ 

1 scalar  assignment 

Part  D 

Increase  in  length  by  1 of  2 

1 scalar  addition 

vector  multiplies  and  2 vector  adds 

2 vector  references 

An  intimate  knowledge  of  the  compiler  is  necessary  to  choose  between 
methods. 


Another  possible  change  in  this  part  is  the  use  of  explicit  temporaries  in 
the  two  vector  assignment  statements  instead  of  letting  the  compiler  allocate 
temporary  space  for  the  intermediate  results  of  vector  multiplications  and 
additions.  If  a great  deal  of  overhead  is  involved  to  allocate  and  free  up 
space  each  time  a temporary  is  needed,  it  might  be  more  efficient  to  have 
some  extra  arrays  like  DOTTMP  and  do  the  allocation  "manually." 

E.  As  in  part  B we  need  not  calculate  the  diagonal  element  of  L,  only 
its  reciprocal,  making  it  possible  to  shorten  the  length  of  the  two  vector 
assignment  statements  at  the  expense  of  having  one  and  possibly  two  extra 
array  references  (DREELl  (JCOL)  and  possibly  DJflAGl  (JCOL))  instead  of  using 
the  ones  already  available  in  DTR  and  DTJ. 

F.  There  are  two  ways  to  perform  the  sum  here,  using  the  loop  and 
subscript  calculation  as  shown,  or  using  double  descriptors  and  the  library 
function  Q8SSUM,  which  has  the  added  advantage  of  performing  the  sum  in 
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double  precision.  The  double  descriptors  would  require  a bit  vector  of 
length  MATDIM*(MATDIM+1 )/2  bits,  and  two  vector  multiplies  (to  calculate 
the  sums  of  the  squares  of  the  real  and  imaginary  parts)  using  it. 

There  are  many  different  ways  of  summing  the  resulting  squares.  The 
two  methods  which  come  to  mind  immediately  are 

I 

1)  adding  the  vectors  of  squares  and  using  Q8SSUM  on  the 
sum  vector,  and 

2)  using  Q8SSUM  twice  on  the  vectors  separately  and  adding 
the  two  scalar  results. 

Assuming  that  we  do  not  need  the  double-precision  results  of  Q8SSUM, 
we  need  only  consider  timing  to  choose  algorithms. 

Since  the  timing  curve  for  vector  additions  as  a function  of  vector 
length  is  a straight  line  not  passing  through  the  origin  (because  of  the 
start  up  time)  and  the  curve  for  Q8SSUM  is  probably  similar,  it  may  be 
that  no  one  method  may  be  best  for  all  vector  lengths.  Some  variant  of  the 
following  method  may  be  best  (where  the  N elements  of  vector  X are  to  be 
summed): 

1)  partition  X into  j consecutive  subvectors,  each  of  length 

n,^,k=l , . . . , j;  where  n,  = max  n. 

' l<k<j 

2)  using  vector  addition,  add  the  j-1  subvectors  to  the  first 
one,  resulting  in  a vector  of  length  n^<N,the  sum  of 
whose  elements  is  the  same  as  the  sum  of  the  elements 
of  the  original  vector. 


C-7 


3)  repeat  steps  1)  and  2)  on  the  resulting  shorter  vector 
until  one  is  obtained  that  is  of  the  best  length  for 
summing  by  Q8SSUM. 

There  are  obviously  a lot  of  variables  in  the  above  algorithm.  One 
specific  example  is  to  take  the  second  half  of  the  vector  X and  add  it  to 
the  first  half,  repeating  the  bisection  process  and  finally  using  Q8SSUM 
on  the  result.  It  requires  an  excellent  knowledge  of  the  timing  of  both 
Q8SSUM  and  vector  addition  as  a function  of  vector  length  to  optimize  this 
algorithm. 


BAKSUB: 


Algorithm:  Given  the  LL  factorization  of  the  matrix  M,we  can  solve 
the  system  MW=l(L  W)=S  for  W by  solving  the  two  triangular  systems 

LT  = S 
★ 

L W = T 


by  back  substitutions 


where  the  null  sums^and  are  taken  to  be  0. 

J=1  J®N+1 

We  successively  compute 

Inputs:  In  addition  to  the  output  from  CHOLDC,  BAKSUB  uses: 


NAME 

TYPE 

COMMON 

EREAL(200) 

R 

blank 

EMAG(200) 

R 

blank 

CONTENTS 

Real  part  of  vector  S 
Imaginary  part  of  vector  S 


Outputs: 


NAME 

TYPE 

COMMON 

TEMPR(200) 

R 

work 

TEMPO (200) 

R 

work 

CONTENTS 

Real  part  of  vector  W 
Imaginary  part  vector  W 
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COMMENTS 

A.  As  in  CHOLDC,  the  efficiency  of  this  code  depends  on  the  compiler's 
treatment  of  constant  array  subscripts  and  vector  temporaries. 

B.  Here  we  question  whether  the  compiler  saves  XMUL  in  a register  to 
use  for  both  multiplies  and  whether  it  knows  only  to  do  one  addition 
to  address  the  array  elements  on  either  side  of  the  equal  sign. 

C.  Again  we  are  concerned  about  the  compiler's  use  of  registers, 
vector  temporaries,  and  identical  array  references  on  either 
side  of  the  equal  sign. 

D.  There  are  two  alternate  ways  of  calculating  XMUL: 

XMUL  = RECIP(MATDIM)*RECIP(MATDIM) 
and 

XMUL  = RECIP(MATDIM) 

XMUL  = XMUL*XMUL  • 

And  to  choose  among  the  three  methods  requires  a knowledge  of  how 
the  compiler  handles  integer  exponentiation  and  identical  array 
references  in  one  expression. 

Also,  if  TEMPR(MTDMM1 ) and  TEMPJ(MTDMM1 ) were  kept  in  registers, 
there  would  be  no  need  to  store  them  in  extra  nonsubsc  ipted 
variables  TRl  and  TJl  for  later  use  in  lines  66,  67,  80,  and  81. 
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Variables  RS  and  JS  are  in  COMMON/WORiy and  set  equal  to  the  next- 
to-last  entries  in  REEL  and  JMAG, 'respectively , since  these  are 
calculated  in  CHOLDC  and  not  changed  in  BAKSUB. 


E.  The  first  backsubstitution  above  used  vector  arithmetic  since 
the  coefficient  matrix  L was  stored  columnwise.  In  the  second 

ir 

back  substitution,  the  coefficient  matrix,  L , is  stored  row- 
wise, so  we  must  use  dot-products.  In  other  words,  in  Equation 

OX  once  t-  is  calculated,ji.  .t.  is  subtracted  from  s.  for  i=j+l 
J 1 J J 1 

through  N inline  vector  operation;, whereas,  in  Equation  (4),  the 

I * ( 

indicated  dot-product  of  and  <w. 

» )j=i+l  ( J 

performed.  The  efficiency  of  this  code  depends,  as  before,  on 


In 

> is  actually 
!j=i+l 


1)  the  speed  of  precalculated  descriptors  versus  vector 
references , 

2)  implementation  of  vector  temporaries  such  as  DXRl, 
DXJl,  DTR,  DTJ,  and  DDOT,  and 

3)  speed  of  Q8SSUM, 

all  of  which  have  been  discussed  before. 

Finally,  although  this  was  not  done  in  either  routine,  it  must  be 
remembered  that  vector  addition  is  faster  than  just  vector  assignment, 
so  that  to  copy  vector  B into  vector  A,  it  is  faster  to  say 


than 


A = B + 0 

A = B . 
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Figure  C-l  Timing  and  Accuracy  Results  for  Case  MATDIM  = 200 


Dimension  of  matrix  = 200 

Time  for  descriptor  calculation  = 2.317  msec 

Time  for  decomposition  = 1400.990  msec 


0/ 

Error  analysis  comparing  L (actual  factor  matrix)  to  L (calculated  factor 
matrix)  (all  errors  absolute) 


Location  of 

Maximum  Error  Maximum  Error  Average  Error 


Real  part 
Imaginary  part 
Absolute  value 


.213E-6 

.252E-6 

.294E-6 


20061  .734E-9 

20062  .694E-9 

20062  .112E-8 


Time  for  first  backsubstitution  = 18.273  msec 
Time  for  second  backsubstitution  = 27.974  msec 


Error  analysis  comparing  W (actual  solution)  to  ft"  (calculated  solution) 


(all  errors  absolute) 

Maximum  Error 

Location  of 
Maximum  Error 

Average  Error 

Real  part 

.512E-5 

11 

.278E-6 

Imaginary  part 

.287E-5 

19 

.283E-6 

Absolute  value 

.574E-5 

11 

.430E-6 
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SUBROUTINE  COVCMP 


2 C UPDATE  SAMPLE  COVARIANCE  MATRIX 


3 

4 

5 

6 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 
2b 
26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

42 


corjioii  /wcTiim/  wsETUp.vroncMP.raKSBi ,;',’bksb2,wcvcmp 
coi'j'.ON  /cov/  covr.(  20100)  ,covj(  20100) 

COMMON  /TIME/  TSETUP , TDECMP , TBKSBl , TD::3B2 , TCVCMP 
COlUIOM  /TEMP/  TDOT(  2 00)  .SMPRC  200) ,SnPJ(  200)  , DTDOT , SRDOT , SIDOT , 
•WRDOT  , WIDOT  .KIIRMR  , WNRMI , SNP.O 
DESCRIPTOR  DXDOT,  SRDOT,  SIDOT,WRDOT,V;iDOT,KNRMR,V/NRMI 
COMMON  MATDIM,nEEL(  20100) ,JMAG( 20100) , EREAL(  200) , EMAG(  200) 

REAL  JMAG 

DESCRIPTOR  DRRl , DREEL  , DJI4AG  , DSMPR , DSMPJ 
DIMENSION  RRK  200) 

INTEGER  W1,W2 
CALL  Q8CLOCK( , ,W1) 

Tl-SECOND(  X) 

J-1 

K-MATDIM 

DO  100  I-1,MATDIM 

ASSIGN  DRRl, RRK  IjK) 

ASSIGN  DREEL, COVR{  J:K) 

ASSIGN  D JMAG , COVJ(  J ) K ) 

ASSIGN  DSMPR,SMPR(  I jK) 

ASSIGN  DSMPJ,SMPJ(  I|K) 

SR-SMPR(  I) 

SJ-SMPJ(  I) 

DRR1“DSMPR*SR 

DREEL-DREEL+DRRl 

DRR1-DSMPJ*SJ 

DREEL-DREEL+DRRl 

DRRl-DSMPJ+SR 

DJMAG-DJMAG-DRRl 

DRR1-DSMPR*SJ 

DJMAG -D JMAG+DRRl 

J-J+K 

K-K-1 

100  CONTINUE 

T2“SECOND( X) 

CALL  Q8CLOCK{  , ,W2 ) 

v)cvcMP-(  m -m ) / 1000. 

TCVCMP-(  T2-T1) *1000. 

RETURN 

END 


SUBROUTINE  CHOLDC 


PERFOni'l  CilOLESKY  LL*  DECOMPOSITION  OP  A POSITIVE  DEFINITE 
HERI-IITIAK  MATRIX 


COl'MON  /BLOWUP/  NEG 

COMMON  MATDIM,REEL(  20100) ,JMAG(  20100) , EREAL(  200) , EMAG(  200) 
REAL  JMAG 

COMMON  /WORK/  MATOLD  .MTDMMl , MTDm2  , MTDMP 1 ,MTDMP2  , SlZEMl , 

* DEREAL, DEMAG, RECIP(  200) ,DREEL(  200)  ,DREEL1(  200)  ,DJMAG(  200) , 

* DJMAGK  200) ,TEnPR(  200) ,TEMPJ(  200) ,DTEMPR(  200) .DTEMPJC  200) , 

* DOTTMPC  200) ,DDTTMP(  200)  ,RS , JS 
REAL  JS 

INTEGER  SlZEMl 

DESCRIPTOR  DREELl , D JMAG 1 , DTEMPR , DTEMP J , DDTTMP , DEREAL , DEMAG 
INTEGER  SUBCOL,ESUB 

DESCRIPTOR  DREEL , D JMAG , DXR , DX J , DTR , DTJ 
DESCRIPTOR  DTTR.DTTJ 

COmON  /TIME/  TSETUP,TDECMP,TBKSB1,TBKSB2,TCVCMP 
COMMON  /WCTIME/  V/SETUP  , WDECMP  , WBKSBl , WBKSB2  , WCVCMP 
CALL  QSCLOCKC  , , IWl ) 

Tl-SECOND{  X) 


65  C 

66  C 

67  C 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 
81 
82 

83 

84 

85 

86 

87 

88 

89 

90 

91 

92  2 

93 

94 


SET  UP  ARRAY  DESCRIPTORS 


IF  ( MATDIM.EQ. MATOLD)  GOTO  1 
MATOLD-MATDIM 
MTDMf<Il  -MATDIM- 1 
MTDMJ12-MATDIM-2 
MTDMPl-t4ATDIM+l 
MTDMP2-MATDIM+2 
ASSIGN  DEREAL , EREALC  2 :MTDMM1 ) 

ASSIGN  DEMAG, EMAG(  2 ; MTDMMl) 

ISUB“1 

LENMl “MATDIM 
LEN=MATDIM 
DO  2 J-1, MTDMMl 
LENMl "LENMl -1 
ISUBPl-ISUB+1 
JPl-J+1 

ASSIGN  DREEL(  J) , REEL(  ISUB:LEN) 
ASSIGN  DJMAG(  J) , JMAG(  ISUBsLEN) 
ASSIGN  DDTTMP(  J)  ,DOTTMP(  1 ; LENMl ) 
ASSIGN  DTEKPRC  J)  , TEMPR(  JPl :LENM1) 
ASSIGN  DTEMPJC  J ) , TEHPJ(  JPl ; LENMl ) 
ASSIGN  DREELK  J)  ,REEL(  ISUBPl ( LENMl ) 
ASSIGN  DJMAGK  J)  ,JMAG(  ISUBPl ; LENMl ) 
IGUB"ISUB+MTDMP1-J 
I,EN“LENM1 
CONTINUE 
SISEH1"ISUB-1 
T2°SECOND(  X) 


43 


44  C 

45  C 

46  C 

47  C 


48 

49 

50 

51 


55 

56 

57 

58 

59 

60 
61 
62 

63 

64 
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* 


95 

96 

97 

98 

99 

100 

101 

102 

103 

104 

105 

106 

107 

108 

109 

110 
111 
112 

113 

114 

115 

116 

117 

118 

119 

120 

121 

122 

123 

124 

125 

126 

127 

128 

129 

130 

131 

132 

133 

134 

135 

136 

137 

138 

139 

140 

141 

142 


500 

10 


C 

C 

C 


C 

C 

C 


c 

c 

c 


CALL  O8CL0CK(  , , IW2 ) 
TSETUP-(  T2-T1 ) *1000. 
WSETUP-< IW2-IW1)/1000. 
RETURN 
CONTINUE 


CALCULATE  RECIPROCAL  OF  FIRST  DIAGONAL  ELEMENT 


NEG-0 

IF  ( REEL( 1) .GT.0.0)  GOTO  10 
NONE-1 

WRITE  (6,500)  NONE,REEL(l) 

F0R1IAT(  16H0ERROn  IN  CHOLDC , 14 , 18H  DIAGONAL  ELEMENT- , El 2 . 5 ) 
NEG-1 
RETURN 
CONTINUE 

XTEMP-1./SQRT{  REEL(  1)) 

RECIPC 1)-XTEMP 


CALCULATE  FIRST  COLUMN 


DREELC 1 ) -DREEL(  1 ) ‘XTEMP 
DJMAG{  1 > -DJMAGC 1 ) •XTEMP 


CALCULATE  COLUMNS  2 THROUGH  MATDIM-1 


IPIRST-1 

LEN-MATDIM 

JCOLMl-0 

DO  3 JCOL-2 .MTDMMl 

IFIRST-IFinST+MTDMP2-JCOL 

ASSIGN  DTR,DREEL{  JCOL) 

ASSIGN  DT J , DJMAG( JCOL ) 

LEN-LEN-1 

ISUB-JCOL 

JCOLMl-JCOLMl+1 


SUBTRACT  MULTIPLES  OF  PREVIOUS  COLUMNS 


DO  4 SUBCOL-1 , JCOLMl 
Tn-REEL(  ISUB) 

ASSIGN  DTTR,REEL(  ISUBiLEN) 
TJ“JMAG(  ISUB) 

ASSIGN  DTTJ, JILAGC ISUB;LEN) 
DTR-DTR+DTTn*(  -TR)+DTTJ»(  -TJ) 
DTJ-DTJ+DTTP.*(  TJ)+DTTJ*(  -TR) 
ISUB-ISUB+MATDIM-SUBCOL 
CONTINUE 
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CALCULATE  RECIPHOCAL  OF  DIAGONAL  ELEMENT 


IP  ( REEL(  IFinST)  .GT.0.U)  GOTO  11 
WRITE  (6,500)  JCOL,RnEL( IFIRST) 
NEG-l 
RETURN 
CONTINUE 

XTEHP-1./SQRT(  RnEL(  IFIRST) ) 

REC1P( JCOL) -XTEMP 


CALCULATE  COLUMN 


DTR>DTR*XTEMP 
DTJ-DTJ “XTEMP 
CONTINUE 


CALCULATE  LAST  DIAGONAL  ELEMENT 


SUM-0. 

ISUB-llATDIM 
DO  7 SUBCOL-l.MTDMMl 
XT-REEL( ISUB) 

XI-JILAG(  ISUB) 
SUM-SUM+XT“XT+XI*XI 
ISUB-ISUB+MATDIM-SUBCOL 
CONTINUE 

ASUM-REEL(  ISUB) -SUM 
IF  ( ASUM.GT.0.0)  GOTO  12 
WRITE  (6,500)  MATDIM,ASUM 
NEG'*1 
RETURN 
CONTINUE 

RECIP( MATDIM) -1 . /SQRT(  ASUM) 
RS-REEL(  SIZEMl) 

JG-JMAG( SI2EM1) 

T2-SECOND(  X) 

CALL  QOCLOCK(  , , IW2  ) 

TDECMP-(  T2-T1)*1000. 

WDnCIlp-(  IW2  - IWl ) / 1000 . 

RETURN 

END 


185 


SUBROUTINE  BAKSUB 


186  C 

187  C GIVEN  THE  LL*  DECOMPOSITION  OF  A MATRIX,  AND  A VECTOR  S,  PERFORM 

188  C TWO  BACK  SUBSTITUTIONS  TO  SOLVE  (LL*)W-S  FOR  W 

189  C 


190 

191 

192 

193 


197 

198 

199 

200 
201 
202 

203 

204 


COmiON  /KCTIME/  V;SETUP  ,VroECMP  ,VfflKSBl  ,t'raKSB2  ,WCVCMP 
COMMON  /TIME/  TSETUP , TDECMP , TBKSBl , TBKSB2 , TCVCMP 
COr-MON  MATDIM,nEnL(  20100)  , JRAGI  20100)  ,EP.EAL(  200)  , EMAG{  200) 
COMMON  /V70RK/  MATOLD  .MTDRUU  ,MTDMM2  , MTDMPl , MTDMP2  , SIZEMl  , 

* DEnEAL,DERAG,nECIP(  200)  ,DnEEL(  200) ,DREEL1(  200) .DJMAGC  200) , 

* DJMAGIC  200) ,TEMPR( 200) ,TEMPJ(  200) ,DTEMPH(  200) ,DTEMPJ(  200) , 

* DOTTMPC  200) ,DDTTMP(  200) ,ns, JS 
REAL  JS 

INTEGER  SIZEMl 
DESCRIPTOR  DREEL,DJMAG 
REAL  JMAG 

DESCRIPTOR  DREELl ,DJMAG1 ,DTEMPR,DTEMPJ, DDTTMP.DEREAL, DEMAG 
DESCRIPTOR  DXRl ,DXJ1 ,DTR,DTJ ,DDOT 
CALL  Q8CLOCK( , , IWl ) 

Tl-SECOND(  X) 


205 

206 

207 

208 

209 

210 
211 
212 

213 

214 

215 

216 

217 

218 

219 

220 

221 


C 

C 

C 

c 

c 


PERFORM  FIRST  BACK  SUBSTITUTION,  (L)TEMP-3 
CALCULATE  FIRST  ELEMENT  OF  TEMP 


XMUL“RECIP(  1) 

TEMPR(  1 ) -EREAL(  1 ) *XMUL 
TEMPJC 1 ) -EMAGC 1 ) *XMUL 


SUBTRACT  MULTIPLE  OF  FIRST  COLUMN  OF  L PROM  3 AND  STORE  IN  TEMP 


DTEMPR(  1 ) -DEREAL  + ( -TEMPR(  1 ) ) *DREEL1(  1 ) +( TEMPJ(  1 ) ) *DJMAG1(  1 ) 
DTEMPJ(  l)"DEMAG  + ( -TEMPJI  1)  ) ‘DREELK  1 ) +(  -TEMPR(  1)  ) •DJ14AG1(  1) 


C 

C 

c 


CALCULATE  ELEMENTS  2 THROUGH  MATDIM-2 


DO  1 JCOL-2  ,MTDmi2 


222  C 

223  C CALCULATE  JCOL-TII  ELEMENT 

224  C 


225  XMUL-RECIP(  JCOL) 

2 2 6 TKMPRC  JCOL ) -TEMPRC  JCOL ) *X 

227  TEMPJ(  JCOL)-TEMPJ(  jqOL)*X 


228  C 

229  C SUBTRACT  MULTIPLE  OF  JCOI,-TH  COLUMN  FROM  TEMP 


C-17 


2za 

231 

232 

233 

234 

235 

236 

237 


AssiGtJ  DXRi  ,Dnnr:Li(  JCOL) 

ASSIGN  DXJl .DJMAGK JCOL) 

TR-TEMPn(  JCOL) 

TJ-TEHPJ(  JCOL) 

DTEMPR(  JCOL) -DTEMPRt  JCOL) ♦(  -TR)«DXRl+(  TJ)»DXJ1 
DTEMPJ(  JCOL) -DTEMPJ(  JCOL ) +( -TJ ) •DXRl+(  -TR) ‘DXJl 
CONTINUE  


238  C 

239  C CALCULATE  ELEMENT  MATDIM-1 

240  C 


241 

242 

243 


XMUL-RECIP(  MTOWll ) 

TEMPR(  MTDMMl ) -TEMPR(  MTDtttll ) ‘XMUL 
TEHPJ(  MTDr4Ml ) -TEMPJ(  MTDMMl ) ‘XHUL 


244 

245 

246 

247 

248 

249 

2 50 

251 

252 

253 

254 


C 

C COMBINE  LAST  STEP  OF  FIRST  BACK  SUBSTITUTION 

C (CALCULATION  OF  ELEMENT  HATDIM)  WITH  FIRST  STEP  OF  SECOND 

C BACK  SUBSTITUTION  ( CALCULATION  OF  ELEMENT  MATDIM  OF  W WHERE 

C (L*)*W-TEMP  AND  W IS  STORED  IN  TEMP) 

C 


XMUL-RECIP(  t4ATDIM)»*2 
TRl-TEMPn(  MTDMMl ) 

TJ1-TEMPJ(  MTDMMl ) 

TEMPRf  MATDIM)“(  TEMPR(  RATDIM) -TR1*RS + TJ1* JS ) "XHUL 
TEMPJC  RATDIH) -(  TEMPJC  MATDIM) -TJ1»RS-TH1»JS ) ‘XMUL 


255  C 

256  C PERFORM  SECOND  BACK  SUBSTITUTION,  (L*)W-TEMP,  STORING  W IN  TEMP 

257  C 


258  T2"SECOND(X) 

259  CALL  Q8CLOCK( , , IW2 ) 

260  T3-SECOND(X) 


C-18 


A 


275 

C 

276 

C 

CALCULATE  DOT  PDEDUCT  OF  JROW-TH  ROW  1 

277 

c 

WITH  TEMP  STARTING  WITH  THE  JROW+1-3T 

278 

c 

279 

ASSIGN  DXRl.DREELK JROW) 

2S0 

ASSIGN  DXJl  ,DJI?AG1(  JROW) 

281 

ASSIGN  DTR,DTEMPn(  JROW) 

282 

ASSIGN  DTJ,DTEHPJ(  JROVn 

283 

ASSIGN  DDOT,DDTTMP(  JROW) 

284 

DD0T-DXR1*DTR<-DXJ1*DTJ 

285 

DOTR-Q3SSUM(  DDOT) 

286 

DDOT-DXni»DTJ-DXJl*DTR 

287 

DOTJ-Q8SSUM(  DDOT) 

288 

c 

289 

c 

CALCULATE  JROV7-TH  ELEMENT 

290 

c 

291 

XMUL-RECIP(  JROW) 

292 

TEMPR(  JROW)  -(  TEMPR(  JROVJ)  -DOTH)  “XMUL 

293 

TEMPJ(  JROW)  ”(  TEMPJ(  JROVJ)  -DOTJ)  *XMUL 

294 

2 

CONTINUE 

295 

T4-SECOND(  X) 

296 

CALL  Q8CLOCK(  , , IW3 ) 

297 

TBKSBl-( T2-T1)*1000. 

298 

TBKSB2“(  T4-T3)*1000. 

299 

VJBKSBl-(  IVJ2-IW1)/1000. 

300 

VJBKSB2-(  IW3-IV72)/1000. 

301 

RETURN 

302 

END 

303  PROGRAll  SYSTST( TAPES -INPUT, TAPE6 -OUTPUT) 

304  C TEST  COVARIANCE  MATRIX  APPROACH  TO  ADAPTIVE  ARRAY  PROBLEM 


305 

306 

307 

308 

309 

310 

311 

312 

313 

314 

315 

316 


CCriMON  fLATDIM.REEU  20100)  ,J;LAG(  20100)  , EREAL(  200)  ,EtIAG(  200) 

REAL  JIIAG 

COMMON  /BLOWUP/  NEG 

CCMMOW  /CPEXP/  HOLD, NOV/, SAVEXPf  200) 

COMPLEX  SAVEXP 

COIBION  /SETUP/  DIM,EIG(  200) , HMEIG , N3AMP1 ,NSAMP2 ,NSAMP3 , KOISE(  200) 
REAL  NOISE 
INTEGER  DIM 

COMMON  /DEBUG/  IDBG( 20) 

REAL  MXLGL’G,t-ttILGEG 
LOGICAL  INOK 

DATA  MAXDIM/2OO/,MAXEIG/2O0/,MNLGEG/0./ 


317  C INITIALIZE  ROUTINE  TO  CALCULATE  COMPLEX  EXPONENTIALS 


318  NOLD-0 

319  999  CONTINUE 


320 

321 

322 

323 

324 

325 

326 

327 

328 

329 

330 

331 

332 

333 

334 

335 

336 

337 

338 

339 

340 

341 

342 


C INPUT  SYSTEM  PARAMETERS 

C INPUT  DIMENSION  OF  SYSTEM, inniBER  OF  EIGENVALUES ,C0MI40N  LOG  OF 

C MAXIMUM  EIGENVALUE,  LOOPING  PARAMETERS  FOR  SAMPLE  SIZE  OF 

C SAMPLE  COVARIANCE  MATRIX, 

C DEBUG  ARRAY — 

C (VfflERE  THE  DEBUG  FLAG  INDICATES  HOW  MANY  VALUES  OF  AN  ARRAY 

C TO  PRINT,  -1  MEANS  PRINT  THE  ENTIRE  ARRAY,  0 MEANS  PRINT  NOTHING, 

C ^0  MEANS  PRINT  THE  INDICATED  NUMBER  OF  ELEMENTS  OR  ROV/S ) 

C 1 - NUtlBER  OF  EIGENVALUES  TO  PRINT 

C 2 - NUMBER  OF  ELEMENTS  OF  TRUE  COVARIANCE  MATRIX  TO  PRINT 

C 3 - NUMBER  OF  OPTIMUM  WEIGHTS  TO  PRINT 

C 4 - NUMBER  OF  NORMALIZED  OPTIMUM  WEIGHTS  TO  PRINT 

C 5 - CHECK  IMAGINARY  PART  OF  OPTIMUM  SNR  TO  SEE  IF  ZERO 

C 6 - HUMBER  OF  WEIGHTS  FROM  SAMPLES  TO  PRINT 

C 7 - NUmiER  OF  NORMALIZED  WEIGHTS  FROM  SAMPLES  TO  PRINT 

C 8 - CHECK  IMAGINARY  PART  OF  SAMPLE  SNR  TO  SEE  IF  ZERO 

C 9 - NUMBER  OF  ROV/S  OF  S7U1PLE  COVARIANCE  I4ATRIX  TO  PRINT 

C 10  - NUMBER  OF  DYNAMIC  RANGES  OF  COMPONENTS  OF  SAMPLES  TO  PRINT 

C 11  - NUMBER  OP  ROV/S  OF  FACTORED  TRUE  MATRIX  TO  PRINT 

C 12  - NUMBER  OF  ROV/S  OF  FACTORED  SAMPLE  MATRIX  TO  PRINT 

C 13  - NUMBER  OF  ELEMENTS  OF  EACH  SAMPLE  TO  PRINT 

C 14  - NUMBER  OF  COMPONENTS  OF  EACH  VECTOR  VOLTAGE  TO  PRINT 

C 15  - IF  .EQ.  0 STOP  AFTER  UNSUCCESSFUL  DECOMPOSITION,  ELSE  GO  ON 


343 

345 

346 

347 

348 

349 

350 

351 

352 


READ(  5,100)  DIM, NMEIG,HXLGEG,NSAMP1,NSAMP2,NSAMP3, NOISE/  1) , 

* ( IDBG(J) ,0-1,20) 

100  FORMAT/  215, F10. 4, 315, F16. 10/2013) 

IF  /DIM.EQ.0)  STOP  777 
INOK-.TRUE. 

IF  / / DIM.GT.  3)  .AND./ niM.LE.t-LAXDIM)  ) GOTO  1 
WRITE  / 6,101)  DIM,HAXDIM 

101  FORMAT/  linODIMENSIon-,  14, 27!I  IS  OUT  OF  R.ANGE  4 THROUGH  ,14) 
INOK-. FALSE. 

1 CONTINUE 


C-20 


353 

354 

355 

357 

358 

359 

360 

361 

363 

364 

365 


IF  < ( NMEIG .GT .0)  . AND . ( niElG .LE .MAXEIG ) ) GOTO  2 
ViRITE  (6,102)  KMEIG.fUXEIG 

102  FOni!AT(  2 3HnNUItBER  OF  EIGENVALUES- , 14 , 16H  IS  OUT  OF  RANGE, 

• IIH  1 THROUGH  ,14) 

XNOK- .FALSE. 

2 CONTINUE 

IF  ( MXLGEG.GT.I4NLGEG)  GOTO  3 
V)RITE  (6,103)  MXLGEG,I1NLGEG 

103  F0RI1AT(  34II0CO1U1ON  LOG  OF  MAXIMUM  EIGENVALUE- , F10 . 5 , 

• 36H  IS  LESS  THAN  MINIMUM  ALLOWED  VALUE- , F10 . 5 ) 

INOK- . FALSE. 

3 CONTINUE 

IF  ( .NOT. INOK)  STOP  1 


366  C CALCULATE  EIGENVALUES 


3 6 7 

368 

369 

370 

371 

372 


VAL-10. •*(  MXLGEG/NMEIG) 
EIGEN-1 . 

DO  4 J-1,NMEIG 
EIGEN-EIGEN'VAL 
EIG( J) -EIGEN 
4 CONTINUE 


373  C CALCULATE  STEERING  VECTOR 

374  CALL  STEER 


375  C CALCULATE  RECEIVER  NOISE 


376 


CALL  RECNOS 


377  C DISPLAY  INPUTS 


378 
3 80 


387 

338 

389 

390 

391 

392 

393 

394 

395 

396 

397 

398 


WRITE(  6,104)DIM,NMEIG,MXLGEG,NSAMP1,NSAMP2,NSAMP3,(  IDBG(  J) ,J 
*,(  J,J-1,20) 

104  FOPJIATC  14H1SYSTEM  INPUTS  ,/,  lOIIODIMENSION,  15(  2H  .),1X,I4,/, 
»22n0NUMBER  OP  EIGENVALUES , 9(  2H  .),1X,I4,/, 

■«33H0COMMOn  LOG  OF  14AXIMUM  EIGENVALUE  , 4(  2H . ),F10.5,/, 

•20H0INITIAL  SAMPLE  SI2E,10(2H  .),1X,I4,/, 

*1SH0FINAL  SAMPLE  SIZE, IK 2H  .),1X,I4,/, 

•17H0STEP  SAMPLE  SIZE,12(2H.  ) , 14 , / , 

•12n0DEBUG  AnPJVY,14(2H  . ) , IX , 201 3 , / , 4 IX , 2013  ) 

V/niTE  (6,105)  ( EREAL(  J)  , J-1  ,DIM) 

105  F0RI4AT(  2 7H0STEERING  VnCTOR(  REAL  PART ) , / , ( IX , 10E12 . 5 ) ) 

WRITE  (6,106)  ( EMAG(  J) , J = 1 ,DIM) 

106  FORMATC  2 7II0GTEERING  VECTOn(  IMAG  PART ) , / , ( IX , 10E12 . 5 ) ) 

WRITE  (6,107)  { HOISE(  .7)  , J-1  ,DIM) 

107  FORMAT(  13H0NOISE  VECTOR , / , ( IX , 10E12 . 5 ) ) 

IF  ( IDEG(  1)  .EQ.0)  GOTO  5 

LENP-IDBG( 1) 

IF  ( LENP.EO.-l)  LENP-NMEIG 
VJRITE  (6,108)  ( EIG(  J)  , J-1  ,LENP) 

103  rORHAT(  12IiOEIGENVALUES  , / ,(  1X,10E12.5)) 

5 CONTINUE 


399  C SOLVE  PROBLEM  WITH  TRUE  MATRIX,  GET  BEST  SNR 


1,20) 


C-21 


CALL  TSOLVE 


400 

401 

402 

403 

404 

405 

406 

407 


C IF  DECOMPOSITION  OF  THUS  HATSIX  FAILED,  GET  NEXT  SET  OF  INPUTS 

IF  (NEG.EQ.l)  GOTO  999 

C GENERATE  SAMPLE  MATRICES  AND  SOLVE  FOR  SNR 

IP  ( NSAIIP1.LE.NSAMP2)  CALL  SSOLVE 

C GO  BACK  FOR  MORE  DATA 

GOTO  999 
END 


C-22 


408 

SUBROUTINE  STEER 

409 

C 

CALCULATE  STEERING  VECTOR 

410 

COmON  IIATDIM.REELC  20100)  ,JMAG(  20100)  , EREAL(  200)  ,EtLAG(  200) 

411 

REAL  JMAG 

412 

COMMON  /SETUP/  DIM,EIG(  200) , NMEIG , KSAMPl ,NSAMP2 , NSAMP3 , NOISE(  200) 

413 

REAL  NOISE 

414 

INTEGER  DIM 

41S 

EREAL(  IjDIM)-!. 

416 

EMAGC 1>DIM)>1. 

417 

RETURN 

418 

END 

U V 


SUBROUTINE  RECNOS 


419 


420  C 


CALCULATE  RECEIVER  NOISE 


421 

422 

423 


COMMON  /SETUP/  D1M,EIG(  200) .NMEIG.NSAMPl ,NSAMP2 ,NSAMP3 ,NOISE(  200) 
REAL  NOISE 
INTEGER  DIM 


CHANGED  BY  RITCHEY  10/10/77 

DATA  NOISE/200*!. E-6/ 


426 

427 

428 


NOISE(  2>DIM-1)  - NOISE(l) 

RETURN 

END 


C-24 


429 


SUBHOUTIHE  TSOLVE 


130  C SOLVE  SYSTEM  USING  TRUE  COVARIANCE  MATRIX  FOR  OPTIMUM  SNR 


431 

432 

433 

434 

435 

436 

437 

438 


442 

443 

444 

445 

446 

447 

448 

4 50 
451 


COMMON  /V7CTIME/  WSETUP,17DECMP,raKSBl  ,W3K3B2  .WCVCMP 
COMMON  /BLOVnjP/  HEG 

COmON  /SETUP/  DIM,EIG(  200) ,NMEIO,NSAMPl,NSAMP2,HSAMP3,NOISE(  200) 
REAL  NOISE 
INTEGER  niM 

COMMON  HATD1M,REEL( 20100) ,JMAG( 20100) ,EREAL<  200) ,EMAG<  200) 

REAL  JMAG 

COMMON  /IVORK/  MATOLD ,MTDMM1 ,MTDMM2 ,MTDMP1 ,MTDMP2 , S IZEMl , 

• DEREAL,DEMAG,RECIP(  200) ,DnEEL(  200) ,DREEL1(  200) .DJMAGC  200) , 

• DJMAGK  200) ,TEMPR(  200) ,TEMPJ(  200) ,DTEMPR(  200) ,DTEMPJ(  200) , 

• DOTTMPt 200) ,DDTTMP(  200) ,R3,J3 
REAL  JS 

INTEGER  3IZEM1 

DESCRIPTOR  DREELl , DJMAGl , DTEMPR , DTEMPJ , DDTTMP , DEREAL , DEMAG 
DESCRIPTOR  DREEL.DJMAG 

COMMON  /TIME/  TSETUP , TDECMP , TBKSBl , TBKSB2 , ICVCMP 
COMMON  /DEBUG/  IDBG(  20) 

COMMON  /TEMP/  TDOT(  200) , SMPR(  200) , SMPJ(  200) , DTDOT , SRDOT , SIDOT , 
•WHDOT , WIDOT , ITNRMR , TOIPJII , SNRO 
DESCRIPTOR  DTDOT , SRDOT , SIDOT ,WRDOT , WIDOT ,WNRMH ,WNRMI 
COMMON  /SAVEW/  WRN(  200)  .VIINC  200) 


452  C CALCULATE  TRUE  COVARIANCE  MATRIX 


453  CALL  BUIDTM 

454  C SOLVE  SYSTEM 

455  C SET  UP  DESCRIPTORS 


456 

457 

458 

459 

460 


MATDIM-DIM 
MATOLD-DIM+1 
CALL  CHOLDC 

IP  ( IDBG(  11) .NE.0)  WRITE  (6,108) 

108  FORMAK  27H0DESCRIPTOH  SET  UP  COMPLETE) 


461  C PERFORM  LL*  DECOMPOSITION 


462 


CALL  CIIOLDC 


463  C IF  DECOMPOSITION  FAILED,  PRINT  OUT  MATRIX  AND  RETURN  FOR  NEXT 

464  C SET  OP  SYSTEM  INPUTS 


465 

466 

467 

468 

469 

470 

471 

472 

473 

474 


IP  (NEG.EQ.l)  IDBG(11)--1 
IF  ( IDBG( 11) .EQ.0)  GOTO  6 
IDEBUG-IDBGC 11) 

IF  ( IDEBUG.FQ. -1)  IDEBUG-DIM 
WRITE  ( 6 , 109 ) 

109  FOR^LAT( 32H0FACTORED  TRUE  MATRIXC REAL  PART)) 
CALL  DISP( DIM, REEL, IDEBUG) 

VJRITE  (6,110) 

110  FOnMAT(  :2II0FACTOnrjD  TRUE  MATR1X(  IMAG  PART)) 
CALL  DISP(  DIM, JMAG, IDEBUG) 


C-25 


475 

WRITE  (6,112)  < RECIP(K) ,K-1,IDEBUG) 

476 

112 

FOniElTC 2dnOTRUE  RECIPROCAL  DIAGONALS ,/,( IX , 10E12 . 5 ) ) 

477 

6 

CONTINUE 

478 

IF  (NEG.EQ.0)  GOTO  10 

479 

WRITE  ( 6,111) 

4 80 

111 

FORIIATC 1H0,10( IH*) , 35HDECOMPOSITION  OF  TRUE  MATRIX  FAILED) 

481 

RETURN 

482 

10 

CONTINUE 

483 

C 

CONJUGATE  STEERING  VECTOR 

484 

Et4AG(  ltDIM)--Et4AG(  ItDIM) 

485 

C 

PERFORM  BACK  SUBSTITUTIONS 

486 

CALL  BAKSUB 

487 

IF  ( IDBGC 3) .EQ.0)  GOTO  1 

488 

LENP-IDBG(  3) 

489 

IF  (LENP.EQ.-l)  LENP-MATDIM 

490 

WRITE  (6,100)  ( TEMPn(  J) ,J-1,LENP) 

491 

100 

FORMAT(  27II0OPTIMUM  WEIGUTS(  REAL  PART) ,/,( 1X,10E12.5) ) 

492 

WRITE  (6,101)  ( TEMPJ(  J) ,J-1,LENP) 

493 

101 

FORMAT(  2 7H0OPTIMUM  WEIGHT3(  IMAG  PART ) , / , ( IX , 10E12 . 5 ) ) 

494 

1 

CONTINUE 

495 

c 

NORMALIZE  WEIGHTS  SO  THEIR  ONE-NORM  IS  ONE  AND  SAVE  THEM 

496 

SDM-0 

497 

DO  2 J-1,DIM 

498 

SUM-SUM+SQRT(  TEMPR(  J ) • *2 ♦TEMPJ(  J ) • • 2 ) 

499 

2 

CONTINUE 

500 

XMUL-1 . /SUM 

501 

WRN(  ljDIM)-TEMPR(  1|DIM)*XMUL 

502 

WIN(  1(DIM)-TEMPJ(  1)DIM)*XMUL 

503 

IF  ( IDBG( 4 ) .EO.0)  GOTO  4 

504 

WRITE  ( 6 , 102 ) SUM 

505 

102 

F0PJ1AT(  2 9n0ONE-NOR14  OF  OPTIMUM  WEIGHTS"  ,E12 . 5 ) 

506 

LENP-IDBG(  4) 

507 

IP  (LEHP.EQ.-l)  LENP-MATDIM 

508 

VmiTE  (6,103)  (V7RN(  J)  ,J-1,LENP) 

509 

103 

FORILAT(  3 8n0NOr^lALIZED  OPTIMUM  V7E1GHTS(  REAL  PART)  , / , ( IX , 10E12 . 5 ) ) 

510 

WRITE  (6,104)  ( WIH(  J) ,J"1,LENP) 

511 

104 

FORMAT(  38n0NORMALIZED  OPTIMUM  WEIGHTS{ IMAG  PART ) , / , ( IX , 10E12 . 5 ) ) 

512 

4 

CONTINUE 

513 

C 

CALCULATE  MAXIMUM  POSSIBLE  TIMES 

514 

TIMtLAX-TSETUP+TDnCMP+TDKSDl+TBKSB2 

515 

WCTMAX  -V7S  ETUP  +WDECMP +V7DKSn  1 +WI5KSB2 

516 

C 

CALCUALTE  OPTIMUM  SNR 

517 

ASSIGN  DTDOT,TDOT(  l(DIM) 

518 

ASSIGN  C.’iTnMR  , SMPn(  1;DIM) 

519 

ASSIGN  VTlinill , SMPJ(  1;D1M) 

520 

ASSIGN  SnDOT,EREAL( 1;DIM) 

C-26 


521 

522 

523 

524 

525 

526 

527 


539 

541 

543 

544 
54  5 

546 

547 

548 

549 
5 50 


105 


107 


106 

5 


ASSIGll  SIDOT,ErnG(  1:DIM) 
AssiGii  v;nDOT,Tr:npn(  IjDim) 
ASSIGII  V7ID0T,?n:'PJ(  l;DIf!) 


DTD0T-r)nD0T*V:nD0T  + SID0T*V7ID0T 
EUnO-OOSSUIK  DTDOT) 

WRITE  ( 6,105) 

FORIIATt  56110  NUMBER  SNR  D3  D0V7N 


1 ,6011  TIME  FOR  TIME  PER 

2 lOH  LOCATION,/, 

3 56H  SAMPLES 

4 60H  BACKCUB  2 SAMPLE  TO 

5 10H  OP,/, 

6 26X,4(10H  MILLISEC) ,30H 

7 10H  SUP-NORM,/, 

8 26X,4( lOH  ( CPU) ) ,30H 

9 2 6X,  4(  1011  (V7ALL)),30H 


TIME  TO 


TIME  FOR  TIME  FOR  TIME  FOR 
MAXIMUM  ONE-HORM  SUP-N0PJ4, 


SET  UP  DECOMP  BACKSUB  1, 

MOVE  POSSIBLE  OF  ERROR  OF  ERROR, 


UPDATE 

MATRIX 

MILLISEC 


MATRIX 

MILLISEC 
( CPU) 


TIME,20X, 

MILLISEC,/, 
(CPU) ,/, 


• 66X,30II  (CPU)  (V7ALL)  (WALL),/, 

1 6 6X,10H  (V7ALL)) 

WRITE  (6,107)  SNR0,TSETUP,TDnCMP,TBKSBl,TBKSB2 ,TIMMAX, 
» WSETUP  ,V7DECMP  , WDKSBl  ,V7BKSB2  ,WCTMAX 
FORMAT( 8H0OPTIMUM, Elfl . 3 , 8X , 4F10 . 3 , 20X , P10 . 3 , / , 

• 26X,4F10.3,2OX,F10.3) 

IP  ( IDBG( 5) .EQ.0)  GOTO  5 

DTD0T-SRD0Tn7ID0T-SID0T*WHD0T 
TSKR-Q8SSUM(  DTDOT) 

WHITE  ( 6 , 106 ) TSNR 

FOPJIAK  26II0IMAG  PART  OF  OPTIMUM  SNR-,E12.5) 

CONTINUE 

RETURN 

END 
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551 

SUBROUTIHE  DOEXP 

552 

C 

PRECALCULATE  COMPLEX  EXPONENTIALS 

553 

c 

( ROVy-TH  ROOTS  OP  UNITY) 

554 

COMMON  /CPEXP/  NOLD,NOt7,SAVEXP(  200) 

555 

COMPLEX  SAVEXP 

556 

COMPLEX  MUL.PP.OD 

557 

DATA  PI/3.14159265358979323/ 

558 

NOl.D-NOVJ 

559 

MUL“CEXP(  CMPLX(  0.0,2 .0»PI/FLOAT(  NOW)  : 

560 

PROD-CMPLXt 1.0, 0.0) 

561 

DO  1 J-l,NOW 

562 

SAVEXP(  J)-PROD 

563 

PROD-PROD»MUL 

564 

1 

CONTINUE 

565 

RETURN 

566 

END 
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9.  . 


567 


SUBROUTINE  BUIDTM 


568 

C 

FORM  TRUE  COVARIANCE  hLITRlX 

569 

COMMON  /CPEXP/  MOLD, nOV7,SAVEXP(  200) 

570 

COMPLEX  SAVEXP 

571 

COMMON  /SETUP/  DIM,EIG(  200) , NMEIG , NSAMPl , NSAMP2 , NS7VKP3 , NOISE(  200) 

572 

REAL  NOISE 

573 

INTEGER  DIM 

574 

COmON  MATDIM,REEL{  20100)  ,JMAG(  20100)  , EREALC  200)  , EMAG(  200) 

575 

REAL  JMAG 

576 

COMMON  /DEBUG/  IDBG( 20) 

577 

COMMON  /TRUE/  ER( 200) ,EC{  200) 

578 

COMPLEX  S 

INTEGER  DIM141,DIMP1 

580 

C 

PRECALCULATE  COMPLEX  EXPONENTIALS 

581 

NOW-DIM 

582 

IF  ( NOV;.NE.NOLD)  CALL  DOEXP 

583 

c 

COMPUTE  EXP  FACTORS  OF  LOWER  HALF  OF  TOEPLETZ  COVARIANCE  MATRIX 

584 

SUM«0.0 

585 

DO  100  K"1,NMEIG 

586 

SUM-SUM+EIG(  K) 

587 

100 

CONTINUE 

588 

ER{ 1)-SUM 

589 

EC(  l)-0.0 

590 

DIMMl-DIM-1 

591 

DO  200  I-1,DIMM1 

592 

ISUB-I+1 

593 

S“CMPLX(  0.0, 0.0) 

594 

DO  210  K“1,NMEIG 

595 

S-S+EIG( K)*SAVEXP(  ISUB) 

596 

ISUB-ISUB+I 

597 

IF  ( ISUB. GT. DIM)  ISUB-ISUB-DIM 

598 

210 

CONTINUE 

599 

ER(  I + 1)-REAL(  S) 

600 

EC{  1 + 1)--AIHAG{  S) 

601 

200 

CONTINUE 

602 

IF  ( IDBG( 2) .EQ.0)  GOTO  500 

603 

LENP-IDBG(  2) 

604 

IP  <LENP.EQ.-1)  LENP“DItl 

605 

WRITE  ( 6,808)  ( ER(  I) ,I”1,LENP) 

606 

808 

FOPMATC 2 3H0TnUE  RTTRIXf REAL  PART) , / , ( IX , 10B12 . 5 ) > 

607 

V7RITE  ( 6,009)  ( EC(  I ) , I-l  ,LENP) 

608 

809 

FOnMAT(  2 3II0TRUE  MATRIX(  IlLAG  P.ART)  , / ,(  IX,  10E12 . 5)  ) 

609 

500 

CONTINUE 

610 

C 

FORM  COVARIANCE  MATRIX 

611 

DIMPl-DIM+1 

612 

INX“0 

613 

DO  300  I-l, DIM 

614 

LOOP-DIMPl-I 
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615 

DO  310  J“l,LOOP 

616 

IllX-IllX+1 

617 

nEEI,(  IIJX)  "Em  0) 

618 

Jl’JlGC  II1X)-EC(  J) 

619 

310 

CONTIUHE 

620 

300 

COMTIHUB 

621 

C 

ADD  NOISE  TO  DIAGONAL 

622 

J-1 

623 

DO  400  I-1,DIM 

624 

REEL(  J ) -nEEL(  J ) +NOISE{ 

625 

J-J+DIMPl-I 

626 

400 

CONTINUE 

627 

RETURN 

628 

END 
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629  SUBROUTINE  S50LVE 

630  C SOLVE  SYSTEM  USING  SAMPLE  COVARIANCE  MATRIX 


631 

632 

633 

634 

635 

636 

637 

638 

639 

640 


644 

645 

646 

647 

648 

649 
6 50 

652 

653 

654 

655 

656 

657 

658 

659 

660 
661 
662 


COM_MON  /SMPULP/  IDBG14  ,NUM3AM 

COMMON  /V;CTIMB/  WSETUP  .VJDECMP  ,Vn3KSBl  ,VraKSB2  .WCVCMP 
COMiMON  MATDIM,REEL(  20100)  ,JtLAG(  20100 ) , EP.EAL(  200)  , EMAG(  200) 

REAL  JMAG 

COMMON  /CLOVJUP/  NEG 

COIUION  /COV/  COVR(  20100) ,COVJ(  20100) 

COMMON  /SETUP/  DIM,EIG(  200 ) , NMEIG , N3AMP1 , NSAMP2 ,NSAMP3 , NOISEt 200) 
REAL  NOISE 
INTEGER  DIM 

COMMON  /VroRK/  MATOLD , MTDMMl ,MTDMI42 , MTDMPl ,MTDMP2 , SIZEMl , 

• DEREAL, DEMAG,  RECIP(  200)  ,DnEEL(  200)  ,DP.EEL1(  200)  ,DJt!AG(  200)  , 

• DJMAGK  200) ,TEMPR(  200) ,TEMPJ( 200) .DIEMPRt  200) ,DTEMPJ(  200) , 

• DOTTMPC  200) ,DDTTMP(  200) ,RS,JS 
REAL  JS 

INTEGER  SIZEMl 

DESCRIPTOR  DREELl .DJMAGl ,DTEMPR,DTEMPJ,DDTTMP, DEREAL, DEMAG 
DESCRIPTOR  DREEL,DJMAG 

COMMON  /TIME/  TSETUP , TDECMP , TBKSBl , TBKSB2 , TCVCMP 
COMMON  /TRUE/  ER(  200) , EC(  200) 

COMMON  /TEMP/  TDOT(  200) ,SMPR(  200) ,SMPJ( 200) ,DTDOT , SRDOT , S IDOT , 
•V7RD0T  , WIDOT  , VJNRMH , WHRMI , SNRO 
DESCRIPTOR  DTDOT  , SRDOT  , SIDOT  , WRDOT  , WIDOT  , WNRMR , WNRMI 
C06U10W  /DEBUG/  IDBG(  20) 

COMMON  /SAVEV;/  V;RN(  200)  ,V/IN(  200) 

DIMENSION  SMIN(  200) , SMAX(  200) 

INTEGER  WT1,WT2 

COMPLEX  C RANG, C RANG I, RAN 

INTEGER  SAMTOT 

DESCRIPTOR  DCOVR,DCOVJ 

DESCRIPTOR  DDR.DDJ 

DATA  PI/3.141.59265358979323/ 

DATA  INTRND/0/ 


663  C INITIALIZE  RANDOM  NUMBER  GENERATOR 


664 

665 

666 
667 


IF  ( INTRND.EQ. 1)  GOTO  500 
INTRND"! 

RAH-CRAKGK  0.0) 

500  CONTINUE 


668  C INITIALIZE  S71MPLE  GENERATING  ROUTINE 

669  CALL  SAMP I 


C70  C ZERO  OUT  COVARIANCE  MATRIX 


671 

672 

673 

674 

675 

676 


LEN“DIM’*(DIM+l)/2 
ASSIGN  DDH,REEL(  1;LEN) 
ASSIGN  DDJ  ,,THAG(  1 jLEN) 
ASSIGN  DCOVn,COVR(  1;LEN) 
ASSIGN  DCOVJ,COVJ(  IjLEN) 
DCOVR-0.0 
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677 

DCOVJ-0.0 

678 

IDBGlil-ID3G(  14) 

679 

IF  ( IDDG14 .EQ. -1)  IDCG14-NMEIG 

680 

IDEBUG-IDBG(  10) 

681 

IF  ( IDEDUG.EQ.-l)  IDEBUG-DIM 

632 

TUPDAT-0.0 

633 

V7UPDAT-0.0 

684 

EAMTOT-0 

685 

KDEBUG-IDBG(  13) 

686 

IF  ( KDEBUG.EQ.-l)  KBEBUG-DIM 

687 

C 

LOOP  THROUGH  SAMPLE  SIZES 

688 

DO  1 NSAMP-NSAMP1,NSAMP2,N3AMP3 

689 

NEED-NSAl-lP-SAMTOT 

690 

C 

LOOP  THROUGH  NUMBER  OF  EXTRA  SA14PLES  NEEDED 

691 

TTCOV-0. 

692 

WTCOV-0.0 

693 

IF  ( IDEBUG.EQ.0)  GOTO  13 

694 

SMIN( l!lDEBUG)-l.E30 

695 

SMAX(  1;IDEBUG)--1.E30 

696 

13 

CONTINUE 

697 

LOPMIN-SAMTOT+1 

698 

DO  3 J-LOPMIN,NSAMP 

699 

C 

GET  SAMPLE 

700 

NUMSAM-J 

701 

CALL  SAMP 

702 

IF  ( IDEBUG.nQ.0)  GOTO  122 

703 

DO  123  K-1,IDEBUG 

704 

SADS-SQR'K  SHPR(  K)*»2+SMPJ(  I<)**2) 

705 

SMIIK  K ) “AMINK  SMIIK  K ) , SABS  ) 

706 

SrjlX(  K)  "AtLAXK  SMAX(  K)  , SABS  ) 

707 

123 

CONTINUE 

708 

122 

CONTINUE 

709 

IF  ( KDEBUG.EQ.0)  GOTO  44 

710 

VmiTB  (6,441)  J,C SMPRC JJ) ,JJ=1,KDEBUG) 

711 

441 

FORMAT(  14II0r.AMPLE  NUMBER , 14 , 12H  (REAL  PART)  , / , ( IX  , 10E12 

712 

V.’RITE  ( 6,442  ) J , ( SMPJ(  JJ  ) , JJ-1  , KDEBUG  ) 

713 

442 

FOni.LAT(  14H0SAMPLE  NUMBER , 1 4 , 12H  ( IMAG  PART ) , / , ( IX , 10E12 

714 

44 

CONTINUE 

715 

C 

ADD  TO  COVARIANCE  MATRIX 

716 

CALL  COVCMP 

717 

TTCOV-TTCOV+TCVCMP 

718 

WTCOV-WTCOV+WCVCMP 

719 

3 

CONTINUE 

720 

IF  ( IDEBUG.EO.U)  GOTO  14 

721 

VJRITE  (6,110)  NEED,  ( ICS  ,S:iI!:{  KS  ) , SMAX(  XS  ) ,:CS”1 , IDEBUG  ) 

722 

110 

FORMAT(  3 2IIODYNAMIC  RAIIGE  OF  COMPOUHlirS  FOR, 14,811  SAMPLES, 

( / , .ix,i3  .ix.nio.-i  ,ix,Ei0.4  , i:;,  13 , IX,  '.31'/.  1 , i::,nio.'! , ix,i3  ,ix, 
E10 . 4 , IX , ElO  . 4 , IX , 1 3 , IX  , 131 7 . 4 , IX  , 3.13 . 4 , IX  , 1 3 , IX  , E10 . 4 , IX , 
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726 

14 

727 

C 

728 

729 

730 

731 

732 

733 

734 

735 

736 

737 

738 

739 

108 

741 

742 

743 

744 

745 

109 

747 

748 

749 

113 

751 

12 

752 

C 

753 

C 

754 

755 

C 

756 

757 

758 

759 

45 

760 

761 

762 

763 

764 

111 

765 

766 

767 

112 

768 

769 

20 

770 

771 

772 

773 

47 

774 

£10.4) ) 

CONTINUE 

MOVE  DATA  FROM  (COVR.COVJ)  TO  (REEL.JMAG) 

CALL  OOCLOCKC  ,,WT1> 

TT1-SEC0ND(  X) 

DDR-DCOVR 
DDJ-DCOVJ 
TT2-SECOND(  X) 

CALL  QOCLOCK(  ,,WT2) 

TMOVE-1OO0.*<  TT2-TT1) 

W!0VE"(  V7T2-V;T1  ) /1000. 

SAMTOT-NSAMP 

IF  ( IDDG(  9)  .EQ.0)  GOTO  12 
WRITE  (6,108)  NSAMP 

FORMAT(  13H0SAMPLE  SIZE- , 14 , 5X , 24HSAMPLE  COVARIANCE  MATRIX, 
11H(REAL  PART)) 

LENP-IDBG(  9 ) 

IF  (LENP.EO.-l)  LENP-DIM 
CALL  DISP(  MATDIM,REEL,LENP) 

WRITE  (6,109)  NSAMP 

FORMAT(  13n0DAI4PLE  SIZE- , 14 , 5X,  24HSAMPLE  COVARIANCE  M-ATRIX, 
llIK  IILAG  PART)) 

CALL  DISPdIATDIM, JMAG,LENP) 

WRITE  (6,113)  NSAMP, ( RECIP(  K) ,K-1,LENP) 

FORMAT( 11H0FOR  NSAMP- , 1 5 , 2 IH  RECIPROCAL  DIAGONALS ,/,( IX , 10E12 
.5)  ) 

CONTINUE 

SOLVE  SYSTEM  WITH  SAMPLE  COVARIANCE  MATRIX 
PERFOPJ4  LL*  DECOMPOSITION 

CALL  CIIOLDC 

IF  DECOMPOSITION  FAILED,  PRINT  OUT  14ATRIX  AND  GET  MORE  SAMPLES 

IP  (NEG.EQ.0)  GOTO  45 
IDTMP“IDBG(  12 ) 

IDBG( 12 )--l 
CONTINUE 

IF  ( IDBG( 12) .EQ.0)  GOTO  20 
IDEBUG-IDBG(  12) 

IP  ( IDEBUG.EQ.-l)  IDEBUG-DIM 
WRITE  (6,111)  NSAMP 

FORMAT( 45HaFACTOnED  SAMPLE  MATRIX( REAL  PART)  FOR  NSAMP-, 15) 
CALL  DISP(  DIM, REEL, IDEBUG) 

V/RITS  (6,112)  NSAMP 

FOnMAT(  4 5H0PACTORED  SAMPLE  ILATRi:<(  IMAG  PART)  FOR  NSf.MP’,I5) 
CALL  DISP( DIM, JMAG, IDEBUG) 

CONTINUE 

IF  (NEG.EQ.0)  GOTO  46 
IDDG( 12 ) -IDTMP 
V7RITE  (6,47)  NSAMP 

POR!'..AT(  1H0,I10,5(  111*)  ,20nDECOMPOSITION  FAILED, 20(  IH*)) 

IF  ( IDBG( 15) .EQ.0)  RETURN 
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115 

GOTO  1 

776 

46 

CONTIHUE 

111 

C 

PERFOnt-l  BACK  SUBSTITUTIONS 

77S 

CALL  BAKSUB 

779 

IF  ( IDDG( 6 ) .EQ.0)  GOTO  7 

780 

LENP"IDBG( 6 ) 

731 

IF  (LENP.EQ.-l)  LENP-DIM 

782 

WRITE  (6,100)  KSAMP,(  TEMPRC J) ,J-1,LESP) 

783 

100 

FORIIATI  3 6H0V7EIG1ITS(  REAL  PART)  FOR  SAMPLE  SIZE-, 110, 

* 

/,(  1X,10E12.5)) 

785 

WRITE  (6,101)  NSAMP,(  TEMPJ(  J) , J-1,LENP) 

786 

101 

F0R11AT(  3 6H0WEIGnTS(  IMAG  PART)  FOR  SAllPLE  SIZE-, 110, 

* 

/,(  1X,10E12.5)  ) 

788 

7 

CONTINUE 

789 

C 

CALCULATE  SNR,  USE  SMPR  AND  SMP0  FOR  TEMPORARY  STORAGE 

loss 

DTDOT-WRDOT»SEDOT+V7IDOT*SIDOT 

791 

XNUM-(  Q8SSUH(  DTDOT)  )»»2 

792 

DTDOT“-WRDOT*SIDOT+WIDOT»SRDOT 

793 

XNUM-XtroM+(  Q8SSUM(  DTDOT) ) *»2 

794 

DO  40  JSNH-1,DIM 

795 

TSNRR-0. 

796 

TSNRI-0. 

797 

JSNRPl-JSNR+1 

798 

JSNRMl-JSNR-1 

799 

IF  (JSHR.EQ.l)  GOTO  41 

800 

DO  42  KSNR“1,JSNRM1 

801 

TSNRR-TGNRE  + TEMPRC  KSNR) »ER(  JSHRPl-KSNR) - 

» 

TEMPJ(  KSNR)*EC(  JSNRPl-KSNR) 

803 

TSNRI-TSNRI  + TEMPR(  KSNR)*EC( JSNHPl-KSNH) ♦ 

ft 

TEMPJ(  KSNR) •ER(  JSNRPl-KSNR) 

805 

42 

CONTINUE 

806 

41 

CONTINUE 

essi 

TSNRR“TSnRR+TEMPR(  JGNR)»(  ER(  l)+NOISE(  J3NR) ) 

308 

TSNni  = TSNRI  + TEMPJ( JSNR) *(  ER(  1 ) +nOISE( JSNR) ) 

809 

IF  { JSNR. EQ. DIM)  GOTO  48 

810 

DO  43  KSNR- JSNRPl , DIM 

811 

TriNRR-TSHRR+TEMPR(  KSNR)  *ER(  KSNR-JSNRMl ) + 

ft 

TE:!PJ(  KSNR)  *EC(  KSNR-JSNRMl) 

813 

Tr)NRI-TSNRI-TEMl>R(  KSNH)*EC(  KSNR- JSNRMl ) + 

ft 

TEMPJ( KSNR) *ER( KSNR-JSNRMl ) 

815 

43 

CONTINUE 

816 

48 

CONTINUE 

817 

SMPR(  JSNR) -TSNRR 

818 

SMPJ(  JSNR) -TGNRI 

819 

40 

CONTINUE 

820 

DTDOT“WRDOT*V,’NRMR+WIDOT*V,’NRMI 

821 

XDENR-onsSUlK  DTDOT) 

822 

DTDOT  “WnDOT*V;HnMI-WIDOT*Vn7RMR 

823 

XDENI-QS.S.SUM(  DTDOT) 

824 

XDEN-XDENR*XDENR+XDENI*XDENI 

825 

SNR-XNUH'-XDENR/XDEN 

826 

IF  ( IDnG(  8) .EQ.0)  GOTO  10 
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S37 

sun I - -XUUM’XDEUI/XDnU 

823 

WniTU  ( 6 , 105  ) HSAI-IP  , sum 

829 

105 

FOPJ!AT(  17H0FOn  SAIIPLE  SIZE- , 110, 18H  IMaC  PART  OP  SNR-,E12.5) 

830 

10 

CONTINUE 

831 

C 

NORMALIZE  VJEIGHTS,  STORE  IN  SMPR.SMPJ 

832 

SUM-0.0 

033 

DO  8 J-1,DIM 

834 

SUM"3U1I*SQRT{  TEMPRC  J)  **2  + TEflPJ(  J)  **2  ) 

835 

8 

CONTINUE 

836 

XMUL-1 . /SUM 

837 

\'/NR34R-VmDOT»XMUL 

838 

WNRMI  -VJIDOT'XMUL 

839 

IF  ( IDDG( 7) .EQ.0)  GOTO  9 

840 

LENP“IDBG(  7 ) 

841 

IP  (LDNP.EQ.-l)  LENP-DIM 

842 

WRITE  (6,102)  NSAMP.SUM 

843 

102 

FORMATC 13H0SAI4PLE  SIZE- , 110, 5X, 20HOHE-NORM  OP  WEIGHTS- , E12 . 5 ) 

844 

V/RITE  (6,103)  ( SMPR(  J)  ,J-1,LENP) 

845 

103 

FORMAT(  30H0NORtlALIZED  V?EIGIITS(  REAL  PART ) , / , ( IX , 10E12 . 5 ) ) 

846 

VmiTE(  6 , 104 ) ( SMPJ( J) , J-1 ,LENP) 

847 

104 

FORMAT(  30H0NOnMALIZED  WEIGHTS(  IMAG  PART) , / ,(  IX, 10E12 . 5) ) 

848 

9 

CONTINUE 

849 

C 

PERFORM  OTHER  ERROR  ANALYSIS 

850 

C 

CALCULATE  ONE-HOPJl  OP  ERROn( ABSOLUTE-RELATIVE) 

051 

EMAX--1.E30 

852 

LI4AX-1 

853 

SUM-0.0 

854 

DO  11  J-1, DIM 

855 

DIFR-vmm  J)-SMPR(  J) 

856 

DIFI-UIIK  J)-SMPJ(  J) 

857 

ERO-SOnT(  DIFR*DIFR+D1PI»DIP1) 

858 

SUH-SUM+ERQ 

859 

IP  ( ERO.LT.EILAX)  GOTO  11 

860 

EI4AX-ERQ 

861 

LMAX-J 

862 

11 

CONTINUE 

863 

TUPDAT-TUPDAT+TTCOV 

864 

VH7PDAT  "VJUPDAT  +(';TC0V 

865 

TTCOV-TTCOV/NEED 

866 

VJTCOV-WTCOV/NEED 

867 

T IMIIVX  - TUPDAT  + TI  !OVE +TS  ETUP  <•  TDECMP  tTBKSB  1 + TBKSB2 

860 

V;CTIlAX"V/iJPDAT+V,'MOVEtV7SETU?+VroECMP+W3KSBl+','J3KSB2 

869 

DBDOV.’U“i0.  •ALOG1.0(  StiRO/SUR) 

370 

WRITE  ( 6 , 107  ) NSAMP  , SNR , DnOO'TN  , TDECMP  , TSK3B1  , TRI<SB2  , TTCOV , TMOVE 
• TIMM.AX  , GUM,  Eia.X  , , (TOECMP , Vr.KSJZ  1 , (■.’3X352  , WTCOV , W'iOVE , V.’CTillX 

072 

107 

FOniLA'K  1110,17  ,E10.3,FO.3,10X,api0.3,I10,/,36X,6F10.3) 

37  3 

1 

CONTINUE 

374 

RETURN 

375 

END 
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876 


SUBHOUTINE  SAMP 


877  C GCMEPATE  A EAMDOM  VECTOP  VmOSS  COMPONENTS  HAVE  THE  COVARIANCE 

87  8 C t-UTniX  DEFINED  IN  CUIDTM 


879 

880 
881 
882 

883 

884 

885 

886 

838 

889 

890 

891 


COMt!ON  /SflPlIEP/  IDEGH  .HIRISAM 
COMMON  /CPEXP/  NOLD,NOV7,SAVEXPt  200) 

COMPLEX  SAVEXP 

COMMON  /SETUP/  DIM,EIG(  200) ,NMEIG .NSAMPl ,KSAMP2 ,NSAMP3 ,KOISE(  200) 
REAL  NOISE 
INTEGER  DIM 

COMMON  /DEBUG/  IDBG( 20) 

COMMON  /TEMP/  TDOT(  200) ,GriPR(  200) , SMPJ(  200) , DTDOT , SRDOT , SIDOT , 
•WP.DOT  ,V;iDOT  MTllRMR  .VnJRMI , SNRO 
DESCRIPTOR  DTDOT  .SRDOT , S IDOT  , VJRDOT , V/IDOT  , WNRMR  .WNRMI 
DIMENSION  SQEIG(  200) , RD(  200) ,SQN03(  200) 

COMPLEX  RD.SUM 
COMPLEX  CRANG 


89  2 C CALCULATE  RANDOM  VOLTAGES  POP.  EACH  EIGENVALUE 


893 

894 

895 

896 

898 


DO  1 K-l.NMEIO 
RD(  K)-CnANG(  0.0) 

IF  (K.I.E.IDBG14)  WRITE  (6,100)  NUMSAM,  X , RD(  K ) 

100  FORMAT(  15H  SAMPLE  NUMBER- , 1 5 , 5X , 15HV0LTAGB  NUMBER- , 15 , 5X , 

* 8IIVOLTAOE-,E15.8,2X,E15.8) 

1 CONTINUE 


899  C COMPUTE  SAMPLE 


900  DO  2 I-1,0IM 

901  C GET  RANDOM  STARTING  POINT  TO  SIMULATE  RECEIVER  NOISE 


902 

903 

904 

905 

906 

907 

908 

909 

910 

911 

912 

913 

914 


SUM-SONOS( I)*CRANG{ 0.0) 

ISUB-I+1 

IF  ( ISUB.GT.DIM)  lSUB-1 
DO  3 K-1,KMEIG 

SUM-SUM ^SQEIG( K ) *RD( K ) •SAVEXP(  ISUB) 
ISUB-ISUD+I 

IF  (ISUB.GT.DIM)  ISUB-ISUB-DIM 
3 CONTINUE 

SMPn(  I)-REAL(  SUM) 

SMPJ(  I ) -AIMAG(  SUM) 

2 CONTINUE 

RETURN 
ENTRY  SAMP I 


915  C PRECALCULATE  SQUARE  ROOTS  OP  EIGENVALUE  AND  NOISE  ARRAYS 


916 

917 

918 

919 


SONOS(  1:DIM)-SQRT( KOISE(  1:DIM)) 
SQniG{  1 tl’MEIG) -SORT(  EIG(  IsNMEIG)) 
RETURN 
END 
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920 


COMPLEX  FUECTIOII  CPJVNG(  XS  ) 


921 

922 

923 

924 

925 

926 

927 

928 

929 


C GEMEPATE  A COMPLEX  GAUSGIAII  PAtlDOM  IIUMBEP 

C IF  ( XS .GT.O) 

C XS  IS  IIEVJ  SEED 

C CRAKG  COMPUTED  FROM  XS  AMD  NEXT  NUf43ER  IN  SEQUENCE 

C on  IF  ( XS .EQ.0) 

C CRAKG  COMPUTED  FROM  NEXT  TVJO  NUMBERS  IN  SEQUENCE 

C OR  IF  (XS.LT.0) 

C CRAKG  IS  LAST  VALUE 

C END  IF 


930 

931 

932 

933 

934 

935 

936 

937 

938 

939 

940 

941 

942 

943 


DIMENSION  RHOC  4097) , ETHETAC  1026) 

COMPLEX  CRAKGI 
COMPLEX  SAVRND 

DATA  MULT/442C564089229/,X3/ .27305093327107640436/ 
DATA  ID6/-47/ ,NX2/1/ 

DATA  PI/3.14159265358979323/ 

IF  < XS)  1,2,3 

1 CONTINUE 
CRANG-SAVRND 
RETURN 

2 CONTINUE 

CALL  Q8MPYL(  MULT , X3 , DX7 ) 

CALL  D8PACK( IB6,DX7,R1) 

X3-DX7 


944  C 


)0.B(  FNAR- 


945 

946 

947 


CALL  Q8MPYL(  MULT,X3 ,DX7) 
CALL  Q8PACK( IB6 , DX7 , R2 ) 
X3-DX7 


948  C 


) 0 . 0(  FNAR' 


949 

950 

951 

952 


GOTO  4 
CONTINUE 

CALL  QORIOR(NX2,XS,X3) 
CALL  Q8PACIC<  IB6,X3,R1) 


953  C 


)SX(  FNAR- 


954 

955 

956 


CALL  Q8MPYL(  MULT,X3 ,DX7 ) 
CALL  QSPACia  IB6 ,DX7 ,R2 ) 
X3-DX7 


957  C 


)0.0(  FKAR- 


958  4 CONTINUE 


959 


C TABLE  LOOK  UP  FOR  ABSOLUTE  V.ALUE  OF  CRANG 


960 

961 


JRHO“IHT(  ni*4096 . +1 . ) 
RO-RnO(  JRHO) 


962 


C TABLE  LOOK  UP  FOR  ANGULAR  PART  OF  CR.ANG 
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963 

JTHE?A-It]T(  82*4096  . +2  . ) 

964 

IF  ( JTKETA.LE. 204  9 ) GOTO  5 

965 

IF  ( JTIIETA.  w;.  307  3 ) GOTO  6 

966 

C 

4TH  QUADRANT 

967 

JTnL-JTnr:TA-307  2 

968 

TnL-ETHETA(  JTRA) 

969 

JTIM-4099-JTHETA 

970 

TIM--ETHETA(  JTIM) 

971 

GOTO  7 

972 

6 

CONTINUE 

973 

C 

3RD  QUADRANT 

974 

JTRL-3075-JTHETA 

975 

TRL-'ETIIETAC  JTRL) 

976 

JTIM“JTHETA-2048 

977 

Tir4--ETHETA(  JTIM) 

978 

GOTO  7 

979 

5 

CONTINUE 

9 80 

IF  ( JTHETA.LE.1025)  GOTO  8 

981 

C 

2ND  QUADRANT 

982 

JTRL-JTHETA-1024 

983 

TRL--ETnETA(  JTRL) 

984 

JTIH-2051-JT1IETA 

985 

TIM“ETnETA(  JTIM) 

986 

GOTO  7 

987 

8 

CONTINUE 

988 

C 

1ST  QUADRANT 

939 

JTRL"1027-JTnETA 

990 

TRL-ETIinTA(  JTRJ^) 

991 

TIH-ETHETA{  JTHETA) 

992 

7 

CONTINUE 

993 

CRANG"RO*CHPLX(  TRL.TIM) 

994 

SAVRND“CRAHG 

995 

RETURN 

996 

ENTRY  CRANGKX) 

997 

CRAKG-CIIPLX(  0.  ,0.  ) 

998 

C 

INITIALISE  RIIO  AND  ETIIETA  FOR  TABLE  LOOK  UP 

999 

XX»-1./8192. 

1000 

XATjD"!  . /4096  . 

1001 

DO  9 J-1,4096 

1002 

XX"XX+XADD 

1003 

RHO( J) “SQRT(  -ALOG<  XX) ) 

1004 

9 

CONTINUE 

1005 

RHO(  4097)-RHO(  4096) 

1006 

XX--PI/4096 . 

1007 

XADD-PI/2048. 
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I(inr3 

1009 

1010 
1011 
1012 

1013 

1014 

1015 


DO  10  J-2,1025 
XX-XX+XADD 
ETHETA(  J)"SIll(  XX) 

10  COMTiriUE 

ETIIETA(  1 ) -ETirETA(  2 ) 
ETHETA(  102  6 )-ETIIETA(  1025) 
RETURN 
END 


I 
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1016 

SUBROUTIIIE  DIGP(  n,X,  JPriLEtl) 

1017 

C 

DEBUG  OUTPUT  ROUTIHE  FOP  I’^TPICES  V\T!03S  L0:;EP  HAUF  OI'LY  IS  STOP.rD 

1018 

c 

JTU  P.OVJ  OF  OUTPUT  COnnnSPOl.'.'JS  to  JTH  row  CT  .ACTUAU  I-.'TFIX  TO  THE 

1019 

C 

niGIIT  01'  THE  DIAGONAL  ELEMEIIT  ( SIGH  OF  IHAGIHARY  PART  REVERSED 

1020 

C 

FOR  HERI'IITIAIJ  ILATRIX) 

1021 

DIMEHSION  X(l) 

1022 

VJRITE  (6,100)  (J,J-1,10) 

1023 

100 

FOr.HATC  4X,  10112) 

1024 

ISUB-1 

1025 

DO  1 J-1,JPRLEH 

1026 

ILST-ISU3  + II-J 

1027 

VJRITE  (6,101)  J,CX(K)  ,K-ISUB,ILST) 

1028 

101 

FOnMAT(  1X,13 ,10E12 .5,/,(  4X,10E12 . 5) ) 

1029 

ISUB-ILST+l 

1030 

1 

CONTINUE 

1031 

RETURN 

1032 

END 

C-40 


420J  Lexington  Avenue  North 
Arden  Hills.  Minnesota  551 1 2 
612/482-2100 


CONTRpL  DATA 
CORJ^DRATION 

riarch  31i  1^77 


hr.  James  Demmel 
Technology  Service  Corp. 

Data  Sciences  Division 
sail  Uilshire  Blvd. 

Santa  Monicai  California  TOlDB 

Dear  Hr-  Demmeli 

I have  been  asked  to  reply  to  your  letter  of  Harch  11t  1177  to 
Hr-  Dennis  Kuba- 

First  1 in  order  to  effect  the  Choleski  decomposition  for  a 
complex  matrix  it  is  required  to  do  S/3  + 0 -CNIt  arithmetic 

operations-  If  it  is  necessary  to  do  it  in  T-ID"^  sec-  { a few 
milliseconds}  for  a SOD  x SOG  matrix-i  one  needs  a computer 
capable  of  S/3  -6-10^  / T-IO'^  FLOPS  = bGO  megaflops-  This  is 
without  any  consideration  of  "overhead"-  There  is  no  computer 
presently  capable  of  such  performance-  Indeed  not  ^en.  the 
M pipe  STAR  IGGC  will  be  capable  of  such  performance  on  a 
SOD  X SDG  matrix- 

Let  me  start  with  one  general  comment-  The  number  of  cycles 
needed  to  perform  the  decomposition  is  a cubic  polynomial-  Host 
of  the  points  you  raise  will  affect  only  the  linear  term  or 
weakly  affect  higher  order  terms - 

As  for  your  specific  questions  about  the  compiler!  it  is  relatively 
naive  compared  to.  the  optimizing  FORTRAN  compilers  for  IBH  3tiQ/37D 
or  CDC  7bGG's-  The  best  way  to  get  the  answers  to  your  compiler 
questions  is  to  get  an  assembly  listing  and  register  map  by 
exercising  the  proper  options  on  the  FORTRAN  card- 

Some  things  you  should  check  for  are: 

•Cl}  Loads  and  stores  degrade  performance  greatly-  Descriptor 
loads  can  impede  vector  operations.  Rewriting  assembly 
code  to  get  bottom  loadi  top  store  code  is  often  advantag- 
ous-  with  this  type  of  code  the  load  can  be  "hidden" 
behind  branch  time- 
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-C2>  Fill  the  register  file  as  full  as  possible  with  descriptors 
and  index  variables  eliminating  the  need  for  load/store. 

■£3>  Explicit  dyadicization  of  vector  arithmetic  cuts  down 
compiler  generated  overhead. 

{4}  As  for  the  back  substitution  phase-i  if  you  have  very 

many  RHS'sn  all  known  at  oncei  one  can  vectorize  along 
the  number  of  RHS's  and  perform  "simultaneous  solution". 

-[S>  Perhaps  you  could  pack  the  new  starting  points  in  your 
dynamic  inner  loop  descriptors  since  the  lengths  are 
invariant  inside  the  loop. 

The  basic  vector  algorithm  you  used  is  the  most  efficient  one 

I know  of  and  hence  I can  be  of  no  help  there. 

I should  be  in  the  Los  Angeles  area  sometime  in  the  next  bO  days. 

Perhaos  we  could  meet  and  discuss  your  problem  more  fullyi 

especially  the  plans  for  the  STARIDDC. 

I hope  I have  been  of  some  assistance. 


Regards! 


Dr.  ff.  J.^Kascic 

STAR  Consulting  Services 

STAR  Operations  Division 

njK/dm 
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Technology  Service  Corporation 

DaU  Sciences  Division 

2811  Wilshire  Boulevard,  Santa  Monica.  California  <KH01  Phone:  (213)  829-7411 

6 April  1977 


Dr.  M.  J.  Kascic,  Jr. 

Consultant 

STAR  Operations  Division 
4290  Fernwood  Avenue 
St.  Paul,  Minnesota  55112 

Dear  Dr.  Kascic: 

Your  letter  discussing  our  algorithm  for  the  Choleski  decomposition  of 
a positive  definite  Hermitian  matrix  arrived  yesterday,  and  I wish  to 
thank  you  for  your  time  and  effort.  Your  comments  were  generally 
clear  and  helpful,  but  I do  have  a question  about  your  third  numbered 
comment: 

Explicit  dyadicization  of  vector  arithmetic  cuts  down  compiler 
generated  overhead. 

Do  you  mean  to  use  expanded  expressions  like 

DTEMPl  = DA*DB 
DTEMP2  = DC*DD 
DSUM  = DTEMPl  + DTEMP2 

instead  of 

DSUM  = DA*DB  + DC*DD 

(where  all  variables  are  descriptors),  or  to  use  explicit  vector  references 
like  SUM  (1;LEN)  instead  of  assigning  a descriptor  and  using  the  descriptor? 

We  would  be  very  happy  to  have  you  call  or  come  in  to  TSC  to  talk  to  us 
when  you  are  in  Los  Angeles.  We  look  forward  to  discussing  plans  for  the 
STAR  100  C. 

Yours  truly, 

/ 

James  Detimel 
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A2Q}  \-  xjn^on  Avemje  North 
Arden  Hills.  Minnesota  551 1 2 
612/482  2 ICX) 


CONTRpL  DATA 

cortoration 


April  2tn  1177 


Mr.  James  Demmel 
Technological  Service  Corp- 
Uilshire  Blvd- 
Santa  Monicas  CA-  1QMn3 

Dear  Mr.  Demmels 

I am  pleased  that  I have  been  of  help.  To  answer  your  question 
about  temporaries s a statement  such  as: 

DSUM  = DA*DB  + DC*DD 

should  be  expanded  to: 

DTEMP  = DA»DB 
DSUM  = DC*DD 
DSUn  = DSUM  + DTEflP 

This  forces  only  one  temporary  to  be  constructed.  In  generals 
gives  an  even  more  complicated  set  of  algebraic  statementss  there 
is  usually  and  "irreducible"  number  of  temporaries  needed.  The 
compiler  does  not  recognize  this  fact.  Indeed  the  statement: 

DC  = DC  + DA  + DB 

will  compile  into: 

TEMP  = DA  + DB 
DC  = DC  + TEMP 

Clearly  it  is  better  to  have: 

DC  = DC  + DA 
DC  = DC  + DB 

Please  let  me  know  if  I can  be  of  any  further  assistance. 

Regards  s 

CONTROL  DATA  CORPORATION 

Dr.  M.  J.  Kascic 

STAR  Consulting  Services 

MJK/dm 
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Appendix  D.  CRAY-1  Software 
SUMMARY  AND  DOCUMENTATION 


As  discussed  in  Section  4. 3. 3. 2,  the  algorithm  and  implementation  chosen 
for  the  CRAY-1  are  essentially  the  same  as  those  chosen  for  the  CDC  STAR-100. 
Hence  the  documentation  of  the  STAR  is  essentially  correct  for  the  CRAY  except 
for  one  thing:  the  CRAY  supports  no  explicit  vectorization  in  its  FORTRAN. 

The  compiler,  instead,  examines  DO  loops  in  the  program  for  suitability  for 
vectorization.  The  program  reads  like  standard  FORTRAN.  We  have  indicated 

where  these  vectorizable  loops  are  in  the  program  with  comment  statements. 

The  system  routines  used  for  timings  are  also  different  from  the 

one  on  the  STAR,  and  are  called  SECOND  (CPU  time)  and  RTC  (real  time). 


1 


SUBROUTINE  COVCMP 


2 

C 

UPDATE  SAMPLE  COVARIANCE  MATRIX 

3 

COMMON  /WORK/  MATDIM.jyiTOLD  ,MTDMM1  ,MTD!!:i2  ,»TDMP1  ,MTDMP2  ,SI2EM1 

• RS.JS.REELC  20100) 20100) , R(  20100) ,C(  20100) ,£REAL(  200) , 

• EMAG(  200) ,RECIP(  200) ,TEMPR<  200) ,TEMPJ<  200) 

6 

INTEGER  SIZEMl 

7 

REAL  JS.JMAG 

8 

COMMON  /TIME/  TDECMP , TBKSBl , TBKSB2 , TCVCMP .WDECMP , WBKSBl , WBKSB2 
• VJCVCMP.TSETUP.WSETUP 

10 

INTEGER  CSTR.CENO.SSUB 

11 

LOP-200/MATDIM 

12 

W1-RTC(  DUM) 

13 

CALL  SEC0ND{T1) 

14 

DO  1 J-l,LOP 

15 

1 

CONTINUE 

16 

CALL  SECOND(T2) 

17 

V72-RTC(  DUM) 

18 

CALL  SECOND(  T3 ) 

19 

DO  2 J-l,LOP 

20 

CSTH-1 

21 

MTDMMI-MTDMMl 

22 

CEND-MATDIM 

23 

SSUB-0 

24 

DO  3 I-l.MATDIM 

25 

SR-TEMPR(  I) 

26 

SJ-TEMPJ(  I) 

27 

c»* 

•••  VECTOR  OPERATIONS 

28 

DO  4 JVEC-CSTH,CEND 

29 

R(  JVEC)  -R(  JVEC)  +SR*TEHPR(  JVEC-SSUB)  ■••SJ*TEMPJ(  JVEC-SSUB) 

30 

C(  JVEC)-C(  JVEC)-SR»TEMPJ(  JVEC-SSUB) +SJ«TEMPR(  JVEC-SSUB) 

31 

4 

CONTINUE 

32 

CSTR-CEND+l 

33 

C END - C END ♦MTDMMI 

34 

SSUB-CEND-MATDIM 

35 

MTDimi -MTDMMI - 1 

36 

3 

CONTINUE 

37 

2 

CONTINUE 

38 

CALL  SECOMD( T4 ) 

39 

W3-RTC(  DUM) 

40 

C 

CALCULATE  CPU  AND  WALL  CLOCK  TIMES  IN  MILLISECONDS 

41 

TCVCMP-l  T4-T3-T2  + T1)  *1000 . /FLOATC'LOP ) 

42 

WC  VCMP-(  V/3  - 2 , *W2  +W1)*12.SE  - 6 /FLOAT(  LOP  ) 

43 

RETURN 

44 

END 
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SUBROUTINE  CHOLDC 


45 

46 

C 

47 

C 

48 

C 

49 

C 

50 

53 

54 

55 

57 

58 

59 

60 

61 

C 

62 

C 

63 

C 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

C 

74 

75 

7 6 

77 

1 

78 

C 

79 

c 

80 

c 

81 

32 

83 

84 

500 

85 

86 

10 

87 

83 

89 

c 

90 

c 

91 

C 

PERFORM  CnOLESKY  LL*  DECOMPOSITION  OF  A POSITIVE  DEFINITE 
HERMITIAN  MATRIX 


COmON  /VroRK/  MATDIM.MATOLD  .MTDMMI  .MTDI'IMI  .MTDMPl  ,MTDMP2  .SIZEMl , 

• RS.JS.REEU  20100) ,JMAG( 20100) , R( 20100) ,C(  20100) , EREAL(  200) , 

• EMAG(  200) ,RECIP(  200) ,TEMPR(  200) ,TEMPJ(  200) 

INTEGER  SIZEMl 

REAL  JS.JMAG 

COMMON  /TIME/  TDECMP, TBKSBl , T8KSB2 , TCVCMP,WDECMP,WBKSB1 ,WBKSB2 , 

• V;CVCHP,TSETUP,WSETUP 
INTEGER  SSUB 
INTEGER  SUBCOL 
V/1-RTC(  DUM) 

CALL  SECOND) Tl) 


SET  UP  ARRAY  DESCRIPTORS 


IF  (MATDIM.EQ.MATOLD)  GOTO  1 
MATOLD-MATDIM 
MTDMMI “MATDIM-1 
MTDMM2  -MJVTDIM-  2 
MTDMP1“MATDIM+1 
MTDMP2"t’ATDIH+2 
SIZEMl=MATDIM*(MATDIM+l)/2-l 
CALL  SECOND) TZ) 

W2-RTC(  DUM) 

CALCULATE  CPU  AND  V/ALL  CLOCK  TIME  IN  MILLISECONDS 

TSETUP  = ( T2-T1)*1000. 

WSETUP-) W2-Wl)«12.5E-6 
RETURN 
CONTINUE 


CALCULATE  RECIPROCAL  OP  FIRST  DIAGONAL  ELEMENT 


IF  ( REEL) 1) .GT. 0.0)  GOTO  10 
N0NE”1 

V.’RITE  (6,500)  M0NE,REEL(1) 

FORM.ATC  IGHOEHROR  IN  CIJOLDC  , 14 , 18H  DIAGON.AL  ELEMENT- , E12 . 5 ) 
RETURN 
CONTINUE 

XTE.MP'l  . /SQRT(  REEL(  1 ) ) 

RECIPC 1 ) -XTEMP 


CALCULATE  FIRST  COLUMN 
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92 

•VECTOR  OPERATIONS 

93 

DO  1001  JVEC'l.MATDIH 

94 

REELC  JVEC ) •REEI.(  JVEC ) •XTEMP 

95 

JMAG(  JVEC ) - JMA(3(  JVEC ) 'XTEMP 

96 

1001 

CONTIMVE 

9V 

C 

98 

C 

CALCULATE  COLUMNS  2 THRODOB  MATDIH'l 

99 

c 

100 

ILAST-MATDIM 

101 

LENMl'MTDMMl 

102 

JCOLMl-0 

10'J 

DO  3 JC0L>2 .MTDMMl 

104 

IPIRST-lLAST+l 

105 

ILAST'ILAST'LENHl 

106 

LENMl-LENMl-1 

107 

ISUB-JCOL 

108 

JCOLMl‘JCOLMl'1 

109 

c 

110 

c 

SUBTRACT  MULTIPLES  OP  PREVIOUS  COLUMNS 

111 

c 

112 

DO  4 SUBCOL-l.JCOLMl 

113 

TR--REEL(  ISUB) 

114 

TJ-JMAa(  ISUB) 

115 

SSUB-IFIRST-I8UB 

116 

'•  VECTOR  OPERATIONS 

117 

DO  1002  JVEC-1PIHST,ILAST 

118 

RKEL(  JVEC) -REEL( JVEC) ♦TR*REEL( JVEC-SSUB) -TJ*JMAG(  JVEC-SSUB) 

119 

JHAGC  JVEC) -JMA6( JVEC) ♦TJ»REEL(  JVEC-SSUB) ♦TH*JMAG(  JVEC-SSUB) 

120 

1002 

CONTINUE 

121 

ISUB*ISUB«MATDIM-SUBCOL 

122 

4 

CONTINUE 

123 

C 

124 

c 

CALCULATE  RECIPROCAL  OF  DIAGONAL  ELEMENT 

125 

c 

126 

IF  ( REEL(  IFIRST) .GT.0.0)  GOTO  11 

127 

WRITE  (6,500)  JCOL,REEL(  IFIRST) 

128 

RETURN 

129 

11 

CONTINUE 

130 

XTEMP-1 . /SQRT<  REELC IFIRST) ) 

131 

RECIP(  JCOL) -XTEMP 

132 

c 

133 

c 

CALCULATE  COLUMN 

134 

c 

135 

• VECTOR  OPERATIONS 

136 

DO  1003  JVEC-IFIRST,1LAST 

. 
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137 

138 

139 

140 


EEE1.(  JVnC)-REEL(  JVEC)»XTEMP 
JMAG(  JVEC ) - JMAG( JVEC ) *XTEMP 
1003  CONTIIIUE 
3 CONTINUE 


141  C 

142  C CALCULATE  LAST  DIAGONAL  ELEtlENT 

143  C 


144 

145 

146 

147 

148 

149 

150 

151 

152 

153 

154 

155 

156 

157 

158 

159 

160 
161 


SUM-0. 

ISUB-MATDIM 
DO  7 SUBC0L-1,MTDMM1 
XT-REEL( ISUB) 

XI-JMAG(  ISUB) 
SUM-SUM+XT»XT+XI*XI 
ISUB-ISUB+MATDIM-SUBCOL 
7 CONTINUE 

ASUM-REEL(  ISUB) -SUM 
IF  ( ASUM.GT.0.0)  GOTO  12 
WRITE  (6,500)  MATDIM.ASUM 
RETURN 
12  CONTINUE 

RnCIP(  MATDIM) -1 . /SQRT(  ASUM) 
RS-REEL(  SIZEMl) 

JS-JMAG(  SIZEMl) 

CALL  SEC0ND(T2) 

W2-RTC(  DUM) 


% 


162  C CALCULATE  CPU  AND  WALL  CLOCK  WIMES  IN  MILLISECONDS 


163 

164 

165 

166 


TDECMP-(  T2-T1)*1000. 
WDECMP-{  VJ2-W1)  *12 . 5E-6 
RETURN 
END 
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167 


SUBROUTINE  BAKSUB 


168  C 

159  C GIVEN  THE  LL*  DECOMPOSITION  OF  A MATRIX,  AND  A VECTOR  S,  PERFORM 

170  c rv;o  back  substitutions  to  soi,ve  (el*)W-s  for  w 

171  C 


172 


175 

176 

177 

179 

180 
181 


COMMON  /VIORK/  MATDIM,MATOLD  ,MTDMM1  ,MTDJ1M2  ,MTDMP1  ,MTDMP2  , SIZEMl , 

• RS  , JS  , r.EEU  20100)  ,JMAG(  20100)  , n(  20100)  ,C(  20100)  ,EREAL(  200)  , 

• EMAG(  200) ,RECIP(  200) .TEMPRl  200) ,TEMPJ( 200) 

INTEGER  SIZEMl 

REAL  JS.JMAG 

COtmON  /TIME/  TDECMP.TBKSBl ,TBK3B2 .TCVCMP.WDECMP.lTBKSBl ,WBKSB2 , 

• V.'CVCMP.TSETUP.WSETUP 
INTEGER  SADD 

Wl-RTCC  DUM) 

CALL  SECOND! Tl) 


182 

183 

184 

185 

186 


C 

C 

C 

c 

c 


PERFORM  FIRST  BACK  SUBSTITUTION,  (L)TEMP“S 
CALCULATE  FIRST  ELEMENT  OF  TEMP 


187 

188 
189 


XMUL-RECIP!  1) 

TEMPR(  1 ) -EREAL!  1 ) ‘XMUL 
TEMPO!  1 ) -EMAG!  1 ) "XMUL 


190  C 

191  C SUBTRACT  MULTIPLE  OF  FIRST  COLUMN  OP  L FROM  S AND  STORE  IN  TEMP 

192  C 

193  C* ••••VECTOR  OPERATIONS 


194 

195 

196 

197 


DO  10O1  JVEC-2,MATDIM 

TEMPR!  JVEC)"EREAL!  JVEO-TEHPR!  1 ) •REEL!  JVEC)+TEMPJ(  1 ) *J!nG!  JVEC) 
TEMPO!  OVEC ) -EMAG!  OVEC  ) -TEMPO!  1 ) "REEL!  OVEC ) -TEMPR!  1 ) *OMi\G(  OVEC ) 
1001  CONTINUE 


198  C 

199  C CALCULATE  ELEMENTS  2 THROUGH  lLATDIM-2 

200  C 


201  SADD-MTDMMl 

202  DO  1 OCOL-2 ,MTDMM2 


203  C 

204  C CALCULATE  OCOL-TII  ELEMENT 

205  C 


206 

207 

203 

209 


OCOLPl-OCOL+1 
XMUL-RECIP!  OCOL) 

TEMPR!  OCOL)  -TEriPRI  OCOL)  •XMUL 
TEMPO!  OCOL) -TEMPO! OCOL) •XMUL 


210  C 

211  C SUBTRACT  MULTIPLE  OF  OCOL-TH  COLUMN  PROM  TEMP 
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212  C 

213  TR--TEMPR(  JCOL) 

214  TJ-TEMPJ(  JCOL) 

215  VECTOR  OPERATIONS 

216  DO  1002  JVEC-JCOLPl,MATDIM 

217  TEMPR(  JVEC) -TEMPR( JVEC) ♦TR*REEL{ JVEC+SADD) +TJ*JMAG(  JVEC+SADD) 

218  TEMPJ(  JVEC ) -TEMPJ( JVEC ) -TJ*REEL( JVEC+SADD) +TH*J)«G(  JVEC+SADD) 

219  1002  CONTINUE 

220  SADD-SADD+MATDIM-JCOL 

221  1 CONTINUE 

222  C 

223  C CALCULATE  ELEMENT  MATDIM-1 

224  C 

225  XMUL-RECIP(MTDMMl) 

22  6 TEMPR(  MTDiail ) -TEMPR(  MTDMMl ) ‘XMUL 

22  7 TEMPJ(  MTDMMl ) -TEMPJ(  MTDMMl ) 'XHUL 

228  C 

229  C COMBINE  LAST  STEP  OP  FIRST  BACK  SUBSTITUTION 

230  C (CALCULATION  OF  ELEMENT  MATDIM)  WITH  FIRST  STEP  OF  SECOND 

2 31  C BACK  SUBSTITUTION  ( CAI,CULATION  OF  ELEMENT  MATDIM  OP  W WHERE 

232  C (L*)»W-TEMP  AND  W IS  STORED  IN  TEMP) 

233  C 

234 

235 

236 

237 

238 

239  C 

240  C PERFORM  SECOND  BACK  SUBSTITUTION,  (L*)W-TEMP,  STORING  W IN  TEMP 

241  C 

242  CALL  SECOND(T2) 

243  W2-RTC(DUM) 

244  CALL  SECOND(T3) 

245  C 

246  C CALCULATE  MATDIM-l-ST  ELEMENT 

247  C 

248 
24  9 

250 

251 

252 

253  C 

254  C CALCULATE  ELEMENTS  MATDIM-2  THROUGH  1 

255  C 


XMUL"RECIP(  MTDMMl) 

Tn"TFM?n(  imTDIM) 

TJ“TEMPJ(  MATDIM) 

TEMPn(  MTDMMl ) "(  TRl -TH+RS-TJ* JS ) »XHUL 
TEMPJC  MTDtUll ) "( TJ1-TJ*RS  + TR*J3 ) *XMUL 


XMUL-REC1P(  imTDIM)»*2 
Tnl-TEMPR{  MTDMI41 ) 

TJ1-TEMPJ(  HTDI4M1) 

TEMPRt  MATDIM)  "(  TEMPRC  MATDIM)  -TRl*nS+TJl»JS)  *Xt4UL 
TEMPJ(  MATDIM) -(  TEMPJ(  MATDIM) -TJ1*RS-TR1*JS ) *XMUL 


2Z6 

257 

258 

259 

260 
261 


SADD-SISE!a-MATDIM 

jp,ov;-mtd:lmi 

DO  2 JTEMP-2,MTDmil 
JEoivpi-jnow 
jnOV7-JROW-l 
SADD-SADD-JTEMP 


262 

263 

264 

265 

266 


C 

C CALCULATE  DOT  PREDDCT  OF  JROV7-TH  HOW  STAHTIIIG  AT  COLD.'IN  JROW*l 

C WITH  TEMP  STARTING  WITH  THE  JROW+l-ST  ELEMENT 

C 

C***»*  VECTOR  OPERATIONS,  DOT  PRODUCTS 


267 

263 

269 

270 

272 

274 


DOTR-0. 

DOTJ-0. 

DO  1003  JVEC“JROV«>l,MATDIM 

DOTR-DOTR*TEMPR( JVEC)*nEEL(  JVEC+SADD ) +TEHPJ(  JVEC ) • 

* JMAG(  JVEC  + SADD) 

DOTJ-DOTJ+TEMPJ{  JVEC) •REEL( JVEC  + SADD ) -TEMPR(  JVEC) * 

• JMAG(  JVEC+SADD) 

1003  CONTINUE 


275  C 

276  C CALCULATE  JROW-TH  ELEMENT 

277  C 


278 

XMUL-RECIP(  JROV7) 

279 

TEMPR(  JROW) -(  TEMPR(  JROW) -DOTH) ‘XHUL 

280 

TEMPJ(  JROW) -(  TEMPJ(  JROW) -DOTJ ) “XHUL 

281 

2 

CONTINUE 

232 

CALL  SECOND(T4) 

283 

W3-RTC(  DUM) 

284  C CALCULATE  CPU  AND  WALL  CLOCK  TIMES  IN  MILLISECONDS 


285 

286 

287 

288 

289 

290 


TBKSBl-(  T2-T1)»1000. 
TBKSB2-( T4-T3)*1000. 
WBKSBl-(  W2-Wl)*12.5E-6 
™iCSB2-(  W3-W2  ) *12 . 5E-6 
RETURN 
END 
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291 


PnOGRAM  SYSTST(  TAPE6-OUTPUT) 


292 

293 

294 


C TEST  ADAPTIVE  ARRAY  SYETEfl  ON  CRAY-1 

C VECTOR  OPERATIOMD  ARE  ITniCATED  BY  COMMENT  CARDS  V;iTH 

C IIJ  COLUMNS  1 THROUGH  6 


295 


298 

299 

300 

302 

303 

304 

305 


309 


COIIMON  /VJORK/  HATDIM.IIATOLD  ,MTDIIM1  ,!ITDMM2  .MTDMPl  ,KTri'.P2  , SI2EM1 , 

* nS,JS,REEL(  20100) ,JMAG(  20100) ,R( 20100) ,C(  20100) ,EnEAL(  200) , 

» EfIAG(  200)  ,RECIP(  200)  ,TE.MPn<  200)  ,TEMPJ(  200) 

INTEGER  SIZEHl 
REAL  JS.JMAG 

COMIION  /TIME/  TDECMP , TBKSDl , TBKSD2  , TCVC.M? , VTOECMP  ,VJBKSB1  ,VmKSB2  , 

* V7CVCMP,TSETUP,WSETUP 

DATA  EnEAL/20O"l . / , EMAG/200"! . / 

MATDIM-0 
WRITE  ( 6,100) 

100  F0H11AT(41H1  HUMDER  TIME  FOR  TIME  FOR  TIME  FOR, 

•50H  TIME  FOR  TIME  TO  TIME  TO  TIME  TO  MAX  TOTAL,/, 

•5 in  SAMPLES  SET  UP  DECOMP  BACKSUB  1 BACKSUB  2, 

•40H  MOVE  2EP.O  OUT  UPDATE  TIME) 

1 CONTINUE 


310 


GET  NEXT  DIMENSION 


311 

312 


MATDIM-NEXDIMC  MATDIM) 

IP  ( I4ATDIM.EQ.0)  STOP  777 


313 


TEST  REINITIALIZATION 


314 

315 

316 


LEN-MATDIM*<  MATDIM+l)/2 
W1-RTC(  DUM) 

CALL  SECOHD(Tl) 


317  C»»***VECTOH  OPERATION 

318  DO  2 JVEC-1,LEN 

319  REELC JVEC)-1. 

320  JMAG(  JVEC)-1, 

321  2 CONTINUE 

322  CALL  SEC0ND(T2) 

323  V;2-RTC(DUM) 


324  C CALCULATE  CPU  AND  WALL  CLOCK  TIMES  IK  MILLISECONDS 


325 

326 


TZERO»( T2-T1)*1000. 
WZERO-(  V/2-W1 ) *12 . 5B-6 


327  C TEST  MOVING  VECTORS 


328 

329 


W1"RTC(  DUM) 

CALL  SECOND(Tl) 


330  C****«VECTOR  OPERATIONS 


331  DO  3 JVEC-1,LEH 

332  R(  JVEC) "REELC JVEC) 
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3 33  C(  JVEC > - JMAG( JV£C ) 

334  3 CONTINUE 

335  CALL  SECOND(T2) 

336  VJ2-RTC(DUM) 

337  C CALCULATE  CPU  AND  WALL  CLOCK  TIMES  IN  MILLISECONDS 

338  TMOVE-(  T2-T1)»1000. 

339  VmoVE-(  W2-391)*12,5E-6 

340  C TEST  SET  UP  OF  POINTERS 

341  MATOLD-MATDIM-1 

3 4 2 CALL  CIIOLDC 

343  C FILL  SAMPLE  VECTORS 

344  C***»*VECTOR  OPERATIONS 

345  DO  5 JVEC-1,MATDIM 

346  TEMPR(  JVEC)-1. 

34  7 TEMPJ(  JVEC)“1. 

348  5 CONTINUE 

34  9 C TEST  UPDATE  OP  COVARIANCE  14ATRIX 

350  CALL  COVCMP 

351  C FILL  UP  MATRIX  WITH  POSITIVE  DEFINITE  HEHMITIAN  DATA 

352  ISUB-1 

353  DUM-10.*PLOAT(MATDIM) 

354  DO  4 J“1,MATDIM 

355  REEL( ISUB)-DUM 

356  JMAG( ISUB)-0. 

357  ISUB-ISUB+MATDIM-J+1 

358  4 CONTINUE 

359  C TEST  DECOMPOSITION 

3 60  CALL  CIIOLDC 

361  C TEST  BACK  SUBSTITUTIONS 

362  CALL  BAKSUB 

363  C CALCULATE  TIMES 

364  TI4AX-TSETUP+TDECMP+TBKS131  + TBKSB2  + TMOVE+T2ERO+2  . *FLCAT(  MATDIM)  * 
*TCVCMP 

365  ramX-I7SETUP+WDECMP+WBKSDl+V7BKSB2+V,?iOV3+WZERO  + 2 . "'PLOAT(  MATDIM)  • 
'■V.’CVCMP 

368  WRITE  {6,101)  MATDIM.TSETUP  , TDECIIP  , TBKSBl , T3KSB?. , THOVE , TZEEO , 

»TCVCMP  , TMAX  ,V7SETUP  ,V)DECMP  .WBKGBl  .VJBKSGl  ,WMOVE  .VIZEP-O  /rfCVCMP  ,W!IAX 
370  101  FOPE!riT(  in0,I10,CF10.3,DX,13nCPU(niLLI3EC)  , / , 1 IX , OFIU . 3 , SX , 

•14HWALL(  MILLISEC) ) 

372  GOTO  1 

373  END 
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374 


runcTion  iiexdiikn) 


375 

C 

GENERATE  DIIIENGIOHG 

376 

IP  (N.GT.0)  GOTO  1 

377 

NEXDIM-20 

378 

RETURN 

379 

1 

CONTINUE 

3 80 

IF  ( N.GE.50)  GOTO  2 

381 

HEXDIM-N+5 

382 

RETURN 

383 

2 

CONTINUE 

384 

IF  (N.GE.200)  GOTO  3 

335 

NEXDIM-W+10 

386 

RETURN 

387 

3 

CONTINUE 

388 

NEXDIM-0 

389 

RETURN 

390 

' END 

SYSTEM 


1 
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Appendix  E.  Conplex  Multiplication 

The  problem  is  to  multiply  two  complex  numbers  a+bi  and  c+di  to  obtain 
the  product  in  the  "best"  way. 

"Best"  is  determined  not  only  by  the  operations  count,  but  depends  on 

1)  the  relative  times  it  takes  the  machine  to  do  an  addition  (+)  and 
a multiplication(*) 

2)  the  computer  architecture  (vector  processors,  like  the  CDC  STAR, 
parallel  processors  like  the  ILLIAC  IV,  machines  like  the  CDC  7600  which  overlap 
execution  of  instructions,  and  any  unmentioned  or  as -yet-unbuilt  combination 

of  the  above) 

3)  number  of  complex  multiplications  to  be  performed  (since  algorithms 
require  different  amounts  of  temporary  storage,  which  may  be  large  on  a 
vector  machine). 

All  these  considerations  are  machine- and  problem-dependent.  In  fact, 
the  third  one  could  result  in  one  program  using  two  different  algorithms, 
if  different  numbers  of  complex  numbers  are  to  be  multiplied  at  different 
times. 

Just  on  the  basis  of  operation  counts,  the  following  two  methods  seem 
best  (where  a+bi  and  c+di  are  the  multiplicands  and  P^+P^i  is  the  product): 

1)  Assuming  an  identifier  can  appear  on  only  one  side  of  the  equal 

sign: 

1.1  t-j  = a*c 

1.2  t^  = b*d 

i • 3 P^  ” ^1  ” ^2 
1.4  t^  = b*c 
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1.5  t2  = a*d 

1.6  p.  = + t2 

§ multiplies  = 4 

if  additions  and  subtractions  = 2 
if  temporaries  = 2. 


Assuming  an  identifier  can  appear  on  both  sides  of  the  equal  sign: 

2.1  p^  - a*c 

2.2  = b*d 

2.3  = Pr  + 

2.4  p^  = b*c 

2.5  t^  = a*d 

2.6  P^  = P^  + 

a multiplies  = 4 

if  additions  and  subtractions  = 2 
if  temporaries  = 2 . 


2)  Assuming  an  identifier  can  appear  on  only  one  side  of  the  equal  sign: 


3.1 

3.2 


3.3 


3.4 


3.5 


3.6 


3.7 


3.8 


t-j  = a + b 
t2  = c - d 

^3  ^ ^1*^2 
t^  = a*d 

t2  - b*c 

p,  ■ t,  + 

'4  ' *3  - ‘2> 

Pr  ■ '4  * ‘l' 


Pr  * ‘3  - ‘2 
^3  " Pr  ^1 
Pr  = ^ 


E-2 


# multiplies  = 3 

# additions  and  subtractions  = 5 

# temporaries  = 4 or  3 . 

3)  Assuming  an  identifier  can  appear  on  both  sides  of  the  equal  sign: 

4.1  p.„  = a + b 
•^r 

4.2  p^  = c - d 

4.3  t^  = P^*P^ 

4.4  t^  = a*d 

4.5  p^  = b*c 

4.6  P^  = t2  + p^ 

4.7  « t,  - 

4.8  P^  = P^  + 

i multiplies  = 3 

# additions  and  subtractions  = 5 

# temporaries  = 2. 


Some  vector  machines  may  not  allow  a vector  identifier  to  appear 
on  both  sides  of  the  equal  sign,  which  is  the  reason  for  two  separate 
implementations  of  the  same  algorithm.  Also,  the  number  of  tempor- 
aries is  of  interest  to  people  using  assembly  language  or  an  optimizing 
compiler  which  tries  to  keep  all  temporaries  in  registers,  of  which  there 
are  only  a limited  number.  A user  using  a machine  which  overlaps  instructions 
may  wish  to  use  more  temporaries  and  allow  more  time  between  an  instruction 
which  uses  a result  of  a previous  instruction  in  order  to  allow  an  instruction 
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to  finish.  Algorithm  1,  for  example,  might  look  like  this  on  a CDC  7600: 


1.1' 

t^  = a*c 

1.2' 

t2  = b*d 

1.3' 

1 

11 

O. 

^2 

1.4' 

tg  = a*d 

1.5' 

t^  = b*c 

1.6' 

Pi  ■ ‘3  * 

^4 

This  implementation  uses  two  more  temporaries  but  allows  the  multiply  units 
to  begin  work  in  line  1.4'  before  the  addition  unit  is  done  in  line  1.3'. 
Algorithm  2 could  change  to 

3.1 ' t^  = a*d 
3.2'  t2  = a + b 
3.3'  tj  = b*c 
3.4'  t4  = c - d 
3.5'  tg  = t2*t^ 

3.6  Pj,  ” ^1  ^ ^3 
3.7'  t2  = tg  - t3 
3.8'  Pi  = t2  + t^ 

where  5 temporaries  are  used  instead  of  4.  Here  there  are  only  3 instructions 
which  must  wait  for  the  previous  instruction  to  finish, whereas  in  the  original 
implementation  there  were  7. 

To  compare  these  two  algorithms  from  an  operation  counts  point  of 
view,  let  t,  be  the  number  of  seconds  required  to  perform  an  addition  or 

a 
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subtraction  and  tj^  be  the  time  required  to  perform  a multiplication. 

Algorithm  1 requires  4t|^  + 2t^  seconds  and  Algorithm  2 requires  St^^^  + St^ 

seconds,  so  Algorithm  1 is  faster  when  and  Algorithm  2 is  faster 

when  t > 3t, . 
m a 

To  show  how  the  analysis  changes  from  machine  to  machine,  we  consider 
a parallel  processor  with  4 independent,  communicating  processors,  each  of 
which  performs  an  operation  and  waits  for  the  others  to  finish.  In  this 
case  Algorithm  1 becomes 

1)  t^  = a*c  t2  = b*d  tg  = b*c  t^  = a*d  (simultaneous) 

2)  Pj-  ~ " ^2  ^i  " ^3  ^4  (simultaneous) 

for  a total  time  of  t^  + t^j^  with  4 temporaries;  whereas, Algorithm  2 becomes 

1)  t^  = a*d  tg  = b*c  tj  = a+b  t^  = c-d  (simultaneous) 

2)  = t^  + t2  tg  = t3*t^  ^6  " ^1  ■ ^2  (simultaneous) 

^r  “ ^5  ^ ^6 

for  a total  time  of  2 t + t,  with  6 temporaries,  obviously  more  than 

in  o 

Algorithm  1. 

It  can  be  shown  that  the  multiplication  of  2 complex  numbers  requires 
at  least  3 multiplications*  but  we  know  of  no  results  for  the  minimum  number 
of  additions.  The  two  algorithms  mentioned  above,  however,  are  the  best 
we  have  found. 

On  the  computers  of  interest,  the  four-multi  ply-two -add  method  is  always 
faster  except  for  integers  on  the  STARAN.  The  only  add  and  multiply  times 
we  have  now  are  for  16- bit  numbers.  The  current  STARAl,  has  timings  of 

★ 

Aho,  A.V.,  J.  E.  Hopcroft,  J.  D.  Oilman,  The  Design  and  Analysis  of 
Computer  Algorithms,  Addison-Wesley,  New  York,  1974,  Chapter  12. 
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19  ^isecs  and  283  ysecs  for  add  and  multiply,  respectively.  The  STARAN-E  system 
has  timings  of  11.3  psecs  and  119  psecs. 
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Appendix  F.  Implementations  of  Algori_thms  for  Direct 
Methods  of  Solving  MW  = S on  Sequential 
Processors 

All  the  implementations  are  written  in  an  easily  understood  structured 
programming  language.  These  implementations  are  presented  for  operations  count 
purposes  and  are  not  inLcnded  to  be  compilable  code. 

In  the  following  discussion,  sample  covariance  matrix  will  be  abbreviated 
to  SCM.  MR  and  MI  will  be  arrays  containing,  respectively,  the  real  and  ima- 
ginary parts  of  the  SCM,  and  will  be  floating-point  (FP).  SR  and  SI  are  FP 
arrays  containing  the  real  and  imaginary  parts  of  the  steering  vector  or  vec- 
tors. XR  and  XI  will  contain  the  real  and  imaginary  parts  of  the  sample  vol- 
tage vectors.  Preceding  each  algorithm  will  be  a list  of  variables  with  their 
types  (floating-point  (FP)  or  integer  (I)),  their  lengths  (as  a function  of 
the  number  of  weights  (N),  the  number  of  samples  (N^)  and  the  number  of  steer- 
ing vectors  (K)),  and  contents  (omitted  for  MR,  MI,  XR,  XI,  SR,  SI;  contents 
include  storage  scheme  (rowwise,  columnwise,  etc.)). 

FOR  loops  behave  as  they  should:  if  the  initial  value  is  greater  than 
the  final  value,  and  the  increment  (default  = 1)  is  positive,  or  if  the 
initial  value  is  less  than  the  final  value  and  the  increment  is  negative,  the 
loop  is  bypassed. 
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TABLE  OF  CONTENTS 


Title  Page 
SCM  Calculations  F-3 
Cholesky  without  Square  Roots  (LDL*)  Decomposition  F-5 
Cholesky  with  Square  Roots  (LL*)  Decomposition  F-7 
First  Back  Substitution  for  LDL*  F-9 
First  Back  Substitution  for  LL*  F-11 
Second  Back  Substitution  for  LDL*  F-13 
Second  Back  Substitution  for  LL*  F-15 
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SCM  Calculation 


Variable 

Type 

Length 

MR 

FP 

N(N+l)/2 

MI 

FP 

N(N+l)/2 

XR 

FP 

N 

XI 

FP 

N 

ROW 

I 

1 

START 

I 

1 

COLUMN 

I 

1 

Contents 

rowwise 

rowwise 

current  row  of  SCM 

location  of  current  element  of  SCM 

current  column  of  SCM 
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DEG  III 
STAET  - 1 


C UPDATE  EACH  ROVl 

Fon  now  - 1 TO  H 

nn(  START)  - iir.(  START)  + jiRi  nov;)''xn(  ROV7) 

+ . +j:i(  no',7)’f  :cT(  ROW) 

UPDATE  EACH  ELEMEUT  OF  CURREtlT  ROV? 

FOR  COEUJllJ  ■■  now  + 1 TO  11 
START  “ START  + 1 

MR',  START)  “ 11R(  START)  ■>  XR(  R017)  *XR(  COLUMN) 

+ • . +:a(  Rov;)  'eik  cotau -ii ) 

MIC  START)  " na  START)  + lEK  ROV?)  *XI(  COLUMN) 

+ . • -):i(  ROV))  *>:r(  columd) 

END  FOR 
END  FOR 

END 


Cholesky  without  Square  Roots  (LDL*)  Decomposition 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2 

rowwise 

MI 

FP 

N(N+l)/2 

rowwise 

TR 

FP 

N 

temporary  array 

TI 

FP 

N 

temporary  array 

RECIP 

FP 

N 

reciprocal  of  elements 
of  D in  decomposition 

M = LDL* 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

row  from  which  multiple 
of  ROWth  row  is  sub- 
tracted 

STARTR 

I 

1 

location  of  ROWth  ele- 
ment of  ROWth  row 
(diagonal  element) 

STARTS 

I 

1 

location  of  SUBROWth 
element  of  SUBROWth 
row 

STARTRS 

I 

1 

location  of  SUBROWth 
element  of  ROWth  row 

LENGTH 

I 

1 

length  of  ROWth  row 
starting  at  element 

ROW  + 1 

LENGTHS 

I 

1 

length  of  SUBROWth  row 
starting  at  element 

SUB ROW  + 1 

LOG 

I 

1 

starting  location  incre- 
ment used  to  point  to 
cun  ent  element  of  SCM 
being  calculated 

O vj 


Enoiti 
sTi\n'i’n  “ 1 
LEKG'i'U  - H 


C CALCULATE  nOVJTH  HOW  OF  L»  FACTOR  IH  II  - LDL* 

FOR  non  “ 1 TO  IJ 
C • ILAICE  UniT  DIAGOIIAL 

ni;ciP(nov7)  - i . /rim  gtartr) 

. LCKGTII  “ LEIIGTII  -1 

FOR  I.OC  " 1 TO  LEIIGTII 

Tp.i  Rovi+i.oc)  " nncipi  RciD’^nni  nTAP.TR+LOC) 

TI(  nOVI-i-LOC)  - r.ECIPI  r.GWI'TlK  GTARTR+LOC) 

EIID  FOR 

C * SUCTPJ\CT  MULTT.PLEn  OP  P.OIv'TII  ROV7  FRCIl  OTHER  ROWS 

LEIIGTIIG  “ I.EIIGTII  - 1 
r.TARVP.G  “ ETAP.TR  + 1 
UTAP.TG  " CTARTT:  + LEIIGTII  + .X 
FOR  EUDROVI  " EOV7  + 1 TO  H 

MRIDTARTD)  “ IIRI  ETAJ’.TG ) - TP.I  GUBP.O'.V)  * 

+ . . MP.I  ETARTRG)  - Tit  SUOROVI) " 

+ . . n'.CI  STARTPG) 

. . FOR  LOC  " X TO  LEIIGTIIfi 

. . . MR(  GTAP.TG  + I.OC)  '■  MR!  GTARTT  + LOC) 

+ . . . -TR!  f;'rnrQV7)’'  I!ni  nTAnTRG+LCC) 

+ . . . -VI(  EUi;i;OV7)' HTI  riVARTP.3>LOC) 

. . . nil  CTARTG >LQC)  ■■  M;.:!  CTAR  'E  t-LOG ) 

+ . . . -TRI  !It'nRO■l7)  ■••nI(fjT.^:>.TR'^^LOC) 

+ , . . +TT(  EUGROVD-’  IIRI  OTARTIIG+LCC) 

EIID  FOP. 

ST.ARTRS  " ETARTRE  + I 
ETARVE  " ETAIITE  :•  /.EIIGTIIE  + 1 
LEIIGTIIE  “ y, EIIGTIIE  -1 
. EIID  FOR 

C irOVE  IIORriALXCED  ROW  FROII  TP.  ADD  TX  TO  HR  AIIO'  MX 

FOR  LOC  " I TO  LEIIGTII 
. 7'.R' .CT.LRTR  .'  LOC)  ■'  TRI  .ETARTR  I -LOO  ) 

!!  •.(  .ETARTR  i-LOC)  " vyi  EVA.P.TR+LOG  > 

RED  FOP 

ETARTF.  " EVARTR  !■  '.EIICTn  X 
FOR 


Cholesky  with  Square  Roots  (LL*)  Decomposition 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2 

rowwise 

MI 

FP 

N(N+l)/2 

rowwise 

RECIP 

FP 

N 

reciprocals  of  diagonal 
elements  of  L in  de- 
composition M = LL* 

ROW 

I 

1 

current  row  of  SCM 

SUBROW 

I 

1 

Row  from  which  multiple 
of  ROWth  row  is  sub- 
tracted 

STARTR 

I 

1 

location  of  ROWth 
element  of  ROWth 
row  (diagonal  ele- 
ments) 

STARTS 

I 

1 

location  of  SUBROWth 
element  of  SUBROWth 

row 

STARTRS 

I 

1 

location  of  SUBROWth 
element  of  ROWth 
row 

LENGTH 

I 

1 

length  of  ROWth  row 
starting  at  element 
ROW  + 1 

LENGTHS 

I 

1 

length  of  SUBROWth 
row  starting  at 
element  SUBROW  + 1 

LOG 

I 

1 

starting  location  in- 
crement used  to  point 
to  current  element 
of  SCM  being  cal- 
culated 

F-7 


BEGIN 
STARTH  - 1 
LENGTH  - N 


BESFAVAlUftE  COPY 


C CALCULATE  EOVJTH  EOV?  OF  I.*  FACTOR  III  M " LL* 

FOR  now  " .1  TO  N 

nEciP(  r.ov;)  = i./bort  (nR(STARTR)) 

I-ERGl'n  “ LENGTH  -1 
FOR  I, GO  " \ TO  i.nnr/cn 

in;(  CTARTR  + LOC)  = RECIPt  ROW)  ■■■  r.r’.(  G?AP.7R>LOC) 
n:c(  CTARTR  + LOC)  " RECIPt  ROW)  •un(  GTARTR-r-LOC) 
END  FOR 


C 


SUBTRACT  IIULTIPLES  OP  ROWTII  ROW  PRCU  OTHER  ROWS 


i;iiO 


I.pnGTnS  " LENGTH  -.1 
STARTRS  " STARTR  + 1 
STARTS  ■■  STARTR  + LENGTH  + 1 
FOR  SUBROW  " ROW  + 1 TO  N 

MRl  STARTS)  “ I !■;(  STARTS  ) -I1R(  STAP.TRS  ) •' 

IIP.!  STT.nTRS  ) -;!X(  STARTRS  ) -LITK  STARTR3 ) 
FOR  l.OC  ^ 1 TO  y.ENGTIIS 
. . MRC  STARTS  : LOO  " )!!’{  SO.ARTS -^LOC) 

-!'.R(  STARTRS)  ••■•.'EH  STA.RTRS■^LOC) 

■ I !X(  STA’iTRS  > ■•'I  n(  STARTP.S  -i-LCC  ) 

MI(  STARTS -^LCC)  " STARTS-SLOC  ) 

. . -nR(  STARTR!!)  ••."■■;(  STRRRTRS -fLOC ) 

W!X{  STARTRS  )'■?!.!(  STARTR:;  ) 

RNU  FOR 

STAR'-’iiS  " STARTRS  + 1 
STAR'-'^S  " STAR'i’S  L1P3GTHS  .1 
. T.E‘.:STilS  " Li)L:GT;'.S  “.1 
END  LOP. 

start:’.  - STARTR  + .t.RNGTH  + X 
FOR 


JillO 


F-8 


First  Back  Substitution  for  LDL*  (LOT  » T) 


Var1 able 

Type 

Length 

Contents 

MR 

FP 

N(N+1)/2 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2 

Imaginary  part  of  L*  factor 
rowwl se 

SR 

FP 

KN 

real  part  of  steering  vectors 
vectorwise 

SI 

FP 

KN 

Imaginary  part  of  steering 
vectors  vectorwise 

RECIP 

FP 

N 

reciprocals  of  elements  of  0 
factor 

AUG 

I 

1 

current  steering  vector 

ROW 

I 

1 

current  row  of  SCM 

STARTSV 

I 

1 

location  preceding  first  lo- 
cation of  current  steering 
vector 

STARTSVA 

I 

1 

location  of  ROWth  element  of 
current  steering  vector 

LENGTH 

I 

1 

length  of  ROWth  row  of  SCM 
starting  at  element  ROW  + 1 

STARTR 

I 

1 

location  of  ROWth  element  of 
ROWth  row  of  SCM 

LOG 

I 

1 

location  increment  used  to 
address  current  element  of 
steering  vector  being  cal- 
culated 

F-9 


, BBT  AVAIIABIE  COPY 

C DO  EACH  STEEnilJG  VECTOR 


ron  AUG  " 1 TO  K 

C • SUDTHACT  IIULTIPLES  OP  EACH  ROV7  Op  H FROM  THE  STSERItJG  VECTORS 

STAPvTGVA  " STARTDV  + 1 
STAHTn  - 1 
LEHGTH  “ n-1 
FOR  nov;  “ 1 TO  n-.i 

FOR  LOC  ■■  1 TO  T.EnGTII 

snt  STARTGVA+LOC)  " DR!  STARTOVA+'.OC ) 

+ . . . -Dn(  STARTEVA)  - r!r:(  STARTR+LOC) 

+ . . . -;:i(  STAR’.'.’EVAj'^I’TC  STARTR+LOC) 

SI{  STARTGVA  ^ LOC)  “ C.Tf  ETARTSVA+LOC  ) 

+ . . . +f;R(  STAliyCVA)  ETAR'.OR  + .EOC) 

+ . . . -F.K  STAr.TSVA)>'Mn(  STAr;Ta  >LOC) 

i;nn  for 

r> TARTS VA  " G TARTS VA  + 1 
STARTR  “ STARTR  + LEEGTII  + 1 
J.EUGTII  " I.EUGTU  -1 
EIID  FOR 


C * MUETil’AT  UT'  RECIPROCATE  DIAGONALS 


EJIO 

EUD 


FOR  AOC  " i TO  n 

SIR  STARTSV+AOC) 
S l(  STiARTSVi-IiOC) 


SIR  r.TARTSV+AOC)  ’■'RECIPt  Aqc) 
SI(  SvARTDv'i-TeCO'ARECIPC  LOC) 


EIID  FOR 

STARTSV  “ STARTSV  + N 


FOR 


F-10 


First  Back  Substitution  for  LL*  (LT  = T) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2 

real  parts  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2 

imaginary  parts  of  L*  factor  row- 
wise 

SR 

FP 

KN 

real  part  of  steering  vectors 
vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering  vec- 
tors vectorwise 

RECIP 

FP 

N 

reciprocals  of  diagonal  elements  of 
L factor 

AUG 

I 

1 

current  steering  vector 

ROW 

I 

1 

current  row  of  SCM 

STARTSV 

I 

1 

location  preceding  first  location 
of  current  steering  vector 

STARTSVA 

I 

1 

location  of  ROWth  element  of 
current  steering  vector 

LENGTH 

I 

1 

length  of  ROWth  row  of  SCM 
starting  at  element  ROW  + 1 

STARTR 

I 

1 

location  of  ROWth  element  of 

ROWth  row  of  SCM 

LOG 

I 

1 

location  increment  used  to  address 
current  element  of  steering  vec- 
tor being  calculated 

F-11 


DEGIU 

s TARTS V - a 


RfSI  AVAlUaE  COPY 


C DO  EACH  STEERING  VECTOR 

FOR  AUG  - 1 TO  K 

C * SUCTRACT  MULTIPLES  OF  EACH  ROW  OF  fl  FROM  THE  OT-JERIHG  VECTORS 

DTARTSVA  - DTARTSV  + 1 

STAnVU  “ 1 
I.EtlGTIl  " n-1 
FOR  row  ■>  1 TO  n 

sr.(  STAP.TSVA)  " rnCTPC  rOVn''ER(  STAHTSVA) 

SKETAPTSVA)  - RECIPf  HOW)  *CI(  STARTSVA) 

FOR  7,OC  " 1 TO  I.IU7KTn 

. . . Or.(  STARTSVA  +I,OC)  “ SW  STAP.TSVA+T.OC) 

+ . . , -Dp{  PTAF.TriVA)  • r:'.(  ri':’."'.PTP.  ►LOO 

♦ . . , STARTSVA)  ■'free  ;iv.UYT.’.+LQC) 

SI(  DTAP-TSVA  i LOO  ••  GI(  STARTSVA LOG ) 

+ . . . +Gr.(  STAr.?SVA)«HT(  STAnOR  + LCC) 

+ . . . -SI(  STARTSVA )’nnK  STAiri’I!  ►LOO 

EHD  FOR 

STARTSVA  » STARTEVA+l 
STiiRTR  “ STARTR  + LEKGTH  ♦ 1 
I.EnGTII  " LEKGTil-l 
Ena  FOR 

STARTS'/  " STARTGV  + U 
EHD  FOR 
EDO 


F-12 


Second  Back  Substitution  for  LDL*  (L*W  = T) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2 

real  part  of  L*  factor  rowwise 

MI 

FP 

N(N+l)/2 

imaginary  part  of  L*  factor 
rowwise 

SR 

FP 

KN 

real  part  of  steering  vectors 
vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering  vec- 
tors vectorwise 

DOTR 

FP 

1 

temporary  location 

DOT  I 

FP 

1 

temporary  location 

LAST 

I 

1 

location  of  last  element  of  SCM 

AUG 

I 

1 

current  steering  vector 

ROW 

I 

1 

current  row  of  SCM 

STARTSV 

I 

1 

location  of  last  element  of  cur- 
rent steering  vector 

STARTR 

I 

1 

location  of  ROWth  element  of 
ROWth  row  of  SCM 

STARTSVS 

I 

1 

location  of  ROWth  element  of 
current  steering  vector 

LENGTH 

I 

1 

length  of  ROWth  row  of  SCM 
starting  at  element  ROW+1 

LOG 

I 

1 

location  increment  used  to  ad- 
dress current  element  of  SCM 
and  steering  vector 

F-13 


I3EGII1 

L7VGT  “ H»(N+l)/2 
CTAP-TCV  - H 


C ELIMINATE  EACH  CTEEniKG  VECTOH 


FOn  AUG  ” 1 TO  IC 
ETARTn  “ LAST 
STAUTGVS  “ STAHTSV-l 
I.ENCTH  “1 

FOR  nO!7  “ n-1  TO  1 STEP  -I 

CTARTR  " GTARTR  - LENGTH  -1 
DOTR  “ a. 

DOTI  “ a. 

FOR  LOG  " 1 TO  LENGTH 

DOTH  “ nOTR  + nn(  ETARTR+LOC)  ■'SRC  3TAP.TGV3+LOC) 

+ . . . -MK  GTARTRM.OO'^SiH  fJTARTSyS+LOC) 

DOTI  ■>  DOTI  + rin(  GTARTR  i-LOC) « 

. . SI(  ETARTSVG  + LOC)  + HI(  STARTR+LOC ) "DR(  STAr.TSVS  + LOC ) 

END  FOR 

SntSTARTSVD)  " GRC  OTARTSVS ) --  DOTR 
GI(  STARTGVS)  “ OKSTARTGVS)  - DOTI 
STARTaVS  - STzARTSVS-l 
LENGTH  » LENGTH  + 1 
END  FOR 

STARTSV"  aTARTSV+N 
END  FOR 
END 


F-14 


Second  Back  Substitution  for  LL*  (L*W  = T) 


Variable 

Type 

Length 

Contents 

MR 

FP 

N(N+l)/2 

real  part  of  L*  rowwise 

MI 

FP 

N(N+n/2 

imaginary  part  of  L*  rowwise 

SR 

FP 

KN 

real  part  of  steering  vectors 
vectorwise 

SI 

FP 

KN 

imaginary  part  of  steering 
vectors  vectorwise 

RECIP 

FP 

N 

reciprocals  of  diagonal  ele- 
ments of  L* 

DOTR 

FP 

1 

temporary  location 

DOT  I 

FP 

1 

temporary  location 

LAST 

I 

1 

location  of  last  element  of 
SCM 

AUG 

I 

1 

current  steering  vector 

ROM 

I 

1 

current  row  of  SCM 

STARTSV 

I 

1 

location  of  last  element  of 
current  steering  vector 

STARTR 

I 

1 

location  of  ROWth  element  of 
ROWth  row  of  SCM 

STARTSV S 

I 

1 

location  of  ROWth  element  of 
current  steering  vector 

LENGTH 

I 

1 

length  of  ROWth  row  of  SCM 
starting  at  element  ROW+1 

LOG 

I 

1 

location  increment  used  to 
address  current  element  of 
SCM  and  steering  vector 

F-15 


BBT  AVAIUBIE  COPY 


DEGIN 

LAST  “ N*(N+l)/2 
STAIITCV  - H 

C ELIMINATE  EACH  STEERING  VECTOR 

FOR  AUG  - 1 TO  K 
ETARTR  - LAST 
. ETAnTSVS  - DTARTSV-1 
LENGTH  - 1 

C • CALCULATE  N-TII  ELEMENT 

smsTARTsv)  - Gn(BrAnrsv)»rEciP(H> 

SKSTAUTSV)  - SI<  STARTSV)''RECIP(  N) 

C » CALCULATE  OTHER  ELEMENTS 

. FOR  R017  - N-1  TO  ,l  STEP  -1 

ETARTR  “ STARTR  - LENGTH  -1 
. . DOTR  “ a. 

DOT!  - a. 

FOR  LOC  - 1 TO  LENGTH 

DOTR  - DOTH  + MRT  ETARTR+LOC ) • 

+ . . . sr.(  riTAP.TSVD+LOC)  - MKSTAP.TR+LOO* 

♦ . . . BIT  STARTSVfJ  t-J.OC) 

DOTI  ” DOTI  + nP.(  r.TAR':'.’P.+LOC)'*SI(.'5TA?.T3V3+LOC)  ♦ 

♦ . . . riK  DTARTR+LOO'^SnC  STAr.TSVS+LOC) 

nrn)  for 

SIKnTARTDVS)  - ( SHC  DTART3VS ) -DOTH)  "RECIPC  ROVT) 
SKSTnHTSVD)  “ ( 3I(  ETAET3V3  ) -DOTI ) ’'r.SC.tPC  HOW) 

STARTSV3  “ ETARTSVS  -1 
I.ENGTH  “ LENGTH  + .1 

END  FOR  I 

STARTSV  " BTARTSV  ■>■  IJ  1 

END  FOR 
END 


Appendix  G.  CDC  7600  Software 


The  algorithms  and  implementations  of  these  programs  are  discussed 
in  Section  4.2.3.  Program  COVTST  and  subroutines  NEXDIM  and  SEKOND  are 
drivers  for  COVCMP,  which  calculates  the  sample  covariance  matrix.  DECTST, 
NEXDEC,  and  SEKOND  drive  CHOL,  which  performs  the  Cholesky  decomposition 
and  both  back  substitutions. 


G-1 


f 


COPY 


PROGRAM  COVTSTC  OUTPUT , TAPES -OUTPUT) 


2 

C 

TEST  COVARIANCE  CALCULATER  ON  CDC760O  MINNEAPOLIS 

3 

COMIION  /TIMING/  T2ER0 , TCVCMP ,V)ZERO  .VJCVCMP 

4 

COMI-IOII  /TIME/  ICSEC.IRSEC 

5 

COMIION/SSTORE/YR(  200)  ,YC(  200) 

6 

COMr-ION/II3FO/n,R(  20100)  ,C(  20100)  ,DR(  200)  ,BC(  200)  ,vm(  200)  ,WC(  ; 

7 

DATA  YR/200*! .0/,YC/200*1.0/ 

8 

WRITE  (6,100) 

9 

100 

FORMAT( lIIl,6X,9nDIMENSION,7X,8H2ERO  0UT,7X,8H1  SAMPLE, 5X, 

* 101I2N  SAMPLES,  10X,5HT0TAL) 

11 

N-0 

12 

999 

CONTINUE 

13 

C 

GET  NEXT  DIMENSION 

14 

N-NEXDIM(  N) 

15 

IF  ( N.EQ.0)  STOP  777 

16 

C 

TEST  ZERO  OUT 

17 

LEN-N*( N+l)/2 

18 

LOP-2000/N 

19 

CALL  SEC 

20 

IWl-IRSEC 

21 

ICl-ICSEC 

22 

DO  991  JL0P-1,L0P 

23 

DO  1 J-1,LEN 

24 

R(  J)-0. 

25 

C(  J)-0. 

26 

1 

CONTINUE 

27 

991 

CONTINUE 

28 

CALL  SEC 

29 

IW2-IRSEC 

30 

IC2-ICSEC 

31 

DO  992  JL0P-1,L0P 

32 

992 

CONTINUE 

33 

CALL  SEC 

31 

IC3-IC2-IC1 

35 

IC4-ICSEC-IC2 

36 

IC5-IC3-IC4 

37 

IW3-IW2-IW1 

38 

IW4-IRSrC-IW2 

39 

IV)5»IW3'IW4 

40 

TZERrf"FLOAT(  ICS)  *27 . 5E-6/PLOAT(  LOP) 

41 

W2ERO-  FLOAT*  1(75  )*27.5E-6  / PLOAT{  LOP  ) 

42 

C 

TEST  COVCUP 

■13 

CALL  COVCMP 

44 

W2N-2*H*V.r,VCMP 

45 

VJTOT-\7ZEr.O+W2N 

46 

T2N-2*!1*TCVCMP 

47 

TTOT~TZEnO+T2N 

48 

WRITE  ( C,101)  H,TZEr.O,TCVCMP,T2N,TTOT,17ZESO,V7CVCHP,W2N,V;TOT 

49 

101 

FORMATC  lI!0,I15,4F15.3,5X,13ItCPU(MILLISEC)  ,/,16X,4P15.3,5X, 
•14H(.’ALL(  HILLIOEC)  ) 

51 

GOTO  999 

52 

END 

G-2 


F 


bblavaiubie  copy 


53 

SUBROUTINE  COVCMP 

54 

COMMON  /TIMING/  TSEBO .TCVCMP .TOERO , WCVCMP 

55 

COlUlOtl  /TIME/  ICGEC,inSEC 

56 

coiuion/ssTop.E/vm  200)  ,yc(  200) 

57 

COMMON/ INFO/N,R(  20100) ,C(  20100) ,BR( 100) ,BC( 200) ,WR(  200) ,WC(  200) 

58 

LOP-1000/N 

59 

CALL  SEC 

60 

IWl-lRSEC 

61 

ICl-ICSEC 

62 

DO  991  JLOP-l,LOP 

63 

JMl-0 

64 

R(  1)-YR(  1)*YR(  1)+YC(  1)*YC(  1) 

65 

INDEX-1 

66 

DO  1 J-2,N 

67 

JMl-JMl+1 

68 

SR-YR(  J) 

69 

SC-YC(  J) 

70 

DO  2 K-1,JM1 

71 

R(  INDEX)-R< INDEX) +SR*YR(K)+SC*YC(K) 

72 

C( INDEX) -C(  INDEX) +SR«YCC  K)-SC*YR(  K) 

73 

INDEX- INDEX+l 

74 

2 

CONTINUE 

75 

R( INDEX )-R( INDEX) +SR»SH+SC*SC 

76 

INDEX- INDEX* 1 

77 

1 

CONTIITOE 

78 

991 

CONTINUE 

79 

CALL  SEC 

80 

IVJ2-IRSEC 

81 

IC2-ICSEC 

82 

DO  992  JLOP-l,LOP 

83 

992 

CONTINUE 

84 

CALL  SEC 

85 

IC3-IC2-IC1 

86 

IC4-ICSEC-IC2 

87 

IC5-IC3-IC4 

88 

IVr3-IV/2-IWl 

89 

IVM-IRSEC-IW2 

90 

IV)5-IV73-IW4 

91 

TCVCMP-FLOATC IC5) *27 . 5E-6/FLOAT(  LOP) 

92 

WCVCHP-FLOAT(  IW5 ) *2 7 . 5E-6 /FLOAT(  LOP) 

93 

RETURN 

94 

END 

G-3 


r 

'BBT 


AVAIUBLE 


95 

FUHCTIOIJ  IJEXD.T1«  N) 

96 

IF  (N.GT.O)  GOTO  1 

97 

NEXDIM-4 

98 

RETURN 

99 

1 

IF  (N.GE.30)  GOTO  2 

100 

NEXDIM-H+1 

101 

RETURN 

102 

2 

IP  ( N.GE.50)  GOTO  3 

103 

NEXDIM-N+5 

104 

RETURN 

105 

3 

IF  (N.GE.200)  GOTO  4 

106 

HEXDIM-N+10 

107 

RETURN 

108 

4 

KnXDIM-0 

109 

RETURN 

110 

END 

111 

IDEHT 

SEKOND 

112 

USE 

/TIME/ 

113 

ICSEC 

ESS 

1 

114 

IRS  EC 

ESS 

1 

115 

USE 

* 

116 

ENTRY 

SEC 

117 

SEC 

BSS 

1 

118 

RTIME 

IRSEC,7CLK 

119 

TIME 

ICSEC,  7CI.K,  USER 

120 

JP 

SEC 

121 

END 

G-4 


BESI.  AVAIU^E  COPY 


1 PROGRAM  DECTST(  OUTPUT, TAPES -OUTPUT) 

2 C TEST  CHOLSESKV  DECOMPOSITION  AND  BACK  SUBSTITUTIONS  ON 

3 C CDC76C0  MINNEAPOLIS 


4 

5 

6 

7 

8 
9 

11 

12 


COWlON  /TIME/  ICSEC.IRSEC 
COMMON/SSTORE/YRC  200) ,YC(  200) 

COMriON/irFO/N,R(  20100)  ,C(  20100)  ,Bn(  200)  ,BC(  200)  ,V7R(  200)  ,WC(  200) 
COMtlON  /TIMING/  TDECMP , TBKSBl , TBKS32  , WDECMP  .VTBKSBl  ,WBKSB2 
WRITE  (6,100) 

100  FORMATC  nil , 6X . 9HDIMENSION , 9X , 6HDECOMP , 5X , 10HBACK  SUB  1 , 
•5X,10nDACK  SUB  2 , 10X , 5HTOTAL) 

N-4 

1 CONTINUE 


13  C GET  NEXT  DIMENSION 


14  N-NEXDEC(N) 

15  IP  (N.EQ.0)  STOP  777 


16  C FILL  UP  MATRIX  WITH  POSITIVE  DEFINITE  HBRMITIAN  TEST  VALUES 


17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 


LEN-K*(  N+l)/2 
DO  2 J-1,LEN 
R(  J)-l. 

C( J)-l. 

2 CONTINUE 
INDEX-1 
DUM-10.»N 
DO  3 J-1,N 
R( INDEX )-DUM 
C( INDEX ) -0 . 
INDEX-IHDEX+J+1 

3 CONTINUE 
DO  4 J-1,H 
BR(  J)-.\. 

BC( J)-l. 

4 CONTINUE 


33  C USE  CHOLESKY  ROUTINE 


34 


CALL  CIIOL 


35 


C WRITE  (6,102)  ( V7R(  J)  ,V)C(  J)  , J-1  ,N) 


36 

37 
30 

39 

40 

42 

43 


102  FORMATC  On0(;EIG!ITS  , / , ( IX , 2E20. 10)  ) 

TMAX - TDnCMP>  TBKSB 1 + T3KSB2 
WMAX  -V7DECMP +V7QKSB 1 +(781(382 

WRITEC  C , 101 ) N , TDECMP , TBICOSl , TBICSEO  , TMAX , WDECMP  ,(7BKSB1  ,VBKSB2  ,(v71AX 
101  FORM.\T(  1H0 , 1 1 S , 4 FI  5 . 3 , 5X , 1 3nCPU(  MILLI3EC ),/,16X,4F15.3,5X, 
*14HWALL(  MILLISEC) ) 

GOTO  1 
END 
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44 

45 

46 

47 
4!] 

49 

50 

51 


simnouTiKi:  ci:ot^ 

CC!1*10N  /TI!*I!;G/  TOnCItP , TBKSDl , TBKS.IZ  .VrOBC!!?  ,TO:CC31  ,KBKSB2 
COKlOIl  /TTllt;/  ICCKC , IHSEC 

coM”.o!i/irro/!’,r.(  loioo)  ,c(  20100)  ,br(  2.00)  ,bc(  200)  ,vm(  200)  ,v.’c(  200) 
coi'j!oii/Br/i'onE/Yn(  200)  ,vc(  200) 

DIMElJSIOri  UR(  20100)  ,UC(  20100) 

DIMEKSIOI)  nECIP(  200) 

EQUIVALENCE  ( UW  1 ) , R(  1 ) ) , ( UC(  1 ) ,C(  1 ) ) 


52 

53 

54 

55 

56 

57 
53 

59 

60 
61 
62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 
7 6 


0 **A***ft****«**A***A**««*A***ftA*4*A***AftA1ki%jkA******«t*AA************«** 

C 

C THIS  r.OUTIKE  SOLVES  A SYSTE?!  OF  SIIIULT.YKEOUS  EQUATIONS  OVER  THE 
C COMPLEX  NUMBERS.  THE  COEFFICIENT  ILATP.IX  MUST  BE  IIEP-MITIAN  AND 
C POSITIVE  DEFINITE.  THE  CHOLESKY  FACTORIZATION  METHOD  IS  USED,  AND  THEN 
C BACK  SUBSTITUTION  {TV/ICE). 

C 

C VARIABLES  . . . 

C N DIMENSION  OF  PROBLEM,  I.E.  MATRIX  MUST  BE  N BY  H. 

C R REAL  PART  OF  INPUT  MATRIX.  ONLY  UPPER  TRl.ANSULAR  PART  OP 

C MATRIX  IS  STORED  ( COT,U!lIA;iSE ) , SO  DI'IENSION  - N*(N*l)/2. 

C C IMAGINARY  PART  OF  INPUT  MJITRIX,  ALSO  STORED  COLUMNWISE. 

C DIMENSION  - H*<N+l)/2 

C BR  REAL  PART  OP  CONSTANT  VECTOR,  I.E.  HEAL  PART  OF  B IN  THE 
C EQUATION  AW”3.  DIMENSIOI)  - N. 

C BC  IMAGINARY  PART  OF  CONSTANT  VECTOR,  ALSO  OP  DIMENSION  N. 

C VJR  REA7,  PART  OF  SOLUTION  VECTOR,  I.E.  REAL  P.AHT  OF  V7  IN  THE 

C EQUATION  AW-B.  DIMENSION  - H. 

C V;C  IMAGINARY  PART  OF  SOLUTION  VECTOR,  .ALSO  OF  DIMENSION  N. 

C UR  REAL  PART  OF  FACTORED  MATRIX.  DIMENSION  - N*(N+I)/2 

C UC  IM-AGINARY  PART  OF  FACTORED  MATRIX.  DIMENSION  - N»(N  + l)/2 

C YR  REAL  PART  OF  RESULTS  OF  FIRST  BACK  SUBSTITUTION.  DIMENSION  - N. 

C YC  IMAGINARY  PART  OP  FIRST  BACK  SUBSTITUTION  RESULTS.  DIMEKSUION  - N 

C 

Q *****«ft*****w1k*********1l(,1hA*1^***««***Jk**«*A«*******«*****tt**<t***1l:«** 


77 

78 

79 

80 
81 
82 


T(  J,K,1SUB,L0P)-FL0AT( J-K-ISUB) *2  7 . 5E-6/FLOAT(  LOP) 

LOP-1 

CALL  SEC 

IV/l-IRGEC 

ICl-ICSEC 

DO  991  JL0P-1,L0P 


83  C 

84  C NOTE.  . . 

85  C MATRIX  IS  ASSUMED  UPPER  TRIANGULAR 

36  C A(I,J),  IN  VECTOR  STOFJIGE , IS  A(J»(J-l)/2  + I) 

87  C 

88  C COMPUTE  FIRST  DIAGONAL  ELEMENT 

39  C 


90 

UR(  1) 

^ SORT(  R(  1 ) ) 

91 

UC(  1) 

- 0.0 

92 

n)iCIP( 

D-i./um  1) 

93 

INDEX 

- 1 

94  C 
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98 

99 
100 
101 
102 


10 


DO  10  J-1,KM1 
IIIDEX  - IHDEX 
Un( INDEX) 

UC( INDEX) 
CONTINUE 


♦ J 

R(  IHDEX) •RECIP( 1) 
C< INDEX )*RnCIP( 1) 


103 

104 

105 


C COMPUTE  DIAGONAL  ELEMENT 
C 


106 

107 

108 

109 

110 
111 
112 

113 

114 

115 

116 

117 

118 

119 

120 
121 
122 


INDEXI  - 1 
DO  40  1-2, NMl 
IIMl  - INDEXI 
ILOV7  - IIMl  ♦ 1 
INDEXI  - INDEXI  ♦ I 
lUP  - INDEXI  - 1 
IMl  - I - 1 
SUMDIA  • 0.0 
DO  20  K-ILOW,IUP 
R1  • UR(K) 

Cl  - UC(  K) 

SUMDIA  - SUMDIA  + B1*R1  ♦ C1*C1 
20  CONTINUE 

UR(  INDEXI)  - SQRT( R( INDEXI ) - SUMDIA) 
UC(  INDEXI)  - 0.0 
URDIV  - 1.0/UD(  INDEXI) 

RECIPC I) -URDIV 


123  C 

124  C COMPUTE  REST  OP  ROW 

125  C 


126 

JJMl  - IIMl 

127 

DO  30  J-I,NM1 

128 

JJMl  - JJHl  ♦ J 

129 

SUMR  - 0.0 

130 

SUMC  - 0.0 

131 

DO  25  K-1,IM1 

132 

R1  - Un(  IIMl+K) 

133 

Cl  - UC(IIM1+K) 

134 

R2  - Un(JJMH-K) 

135 

C2  - UC(JJM1+K) 

136 

SUMR  - SUMR  + Rl*n2  + Ci»C2 

137 

sut:c  - EUIiC  + R1*C2  - C1'''R2 

138 

25 

CONTINUE 

139 

IJ  - JJMl  + I 

140 

un(lj)  - URDIV*(n(IJ)  - SUMR) 

141 

UC(IJ)  - UnDIV*(C(IJ)  - SUMO 

142 

30 

CONTINUE 

143 

40 

CONTINUE 

144  C 
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145  C DO  LAST  ELEMENT 

146  C 


147 

148 

149 

150 

151 

152 

153 

154 

155 

156 

157 

158 

159 

160 
161 
162 
163 


iLow  - rnonxi  + i 
lUP  - INDEXI  ♦ KMl 
suMDiA  - a. a 
DO  50  K-ILOVMUP 
ni  - UR(K) 

Cl  - UC(K) 

SUMDIA  - SUMDIA  + ni*Rl  ♦ C1*C1 
50  CONTINUE 

INDEXI  • INDEXI  ♦ N 

UR(IIIDEXI)  - SORT(  R(  INDEXI)  - SUMDIA) 
UC(  INDEXI)  - 0.0 
RECIP(N)"1./UR(  INDEXI) 

991  CONTINUE 
CALL  SEC 
1V;2-IRSEC 
IC2-ICSEC 
DO  992  JLOP-l,LOP 


164  C 

165  C FIRST  BACK  SUBSTITUTION,  USING  CONJUGATE  TRANSPOSE 

166  C 


167 

168 

169 

170 

171 

172 

173 

174 

175 

176 

177 

178 

179 

180 
181 
132 

183 

184 

185 

186 

187 

188 

189 

190 

191 

192 

193 


DIV-REC1P( 1) 

YRt  1)  - BR(  1)*DIV 
YC( 1)  - BC< 1)»DIV 
INDEX  - 1 
ILOW  - 0 
IHIGH  - 0 
DO  70  1-2, N 

ILOV)  - HUGH  ♦ 2 
HUGH  - IHIGH  + I 
SUMR  - 0.0 
SUMC  - 0.0 
K - 0 

DO  60  J-ILOW, IHIGH 
K - K + 1 

SUMP.  - SUMR  + Yn(K)*UP(J)  + yC(K)*UC(J) 
SUMC  - SUMC  - YR(K)*UC(J)  + YC(K)»US(J) 
60  CONTINUE 

INDEX  - INDEX  ♦ I 
DTV-RECIP(  1) 

Yn(i)  - (BR{i)  - suMn)''aiv 
YC(I)  - (DC(I)  - SUMO^DIV 
70  CONTINUE 
992  CONTINUE 
CALL  SEC 
IV?3«inSEC 
ICS-ICSEC 
DO  993  JLOP“l,LOP 


194  C 

195  C SECOND  BACK  SUBSTITUTION 

196  C 
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Iff  » F 

kmM'  tUilm  1 


fmv 


197 

198 

199 

200 
201 
202 

203 

204 

205 

206 

207 

208 

209 

210 
211 
212 

213 

214 

215 

216 

217 

218 

219 

220 
221 
222 

223 

224 

225 

226 

227 

228 

229 

230 

231 

232 


90 


993 


994 


M - N 

I - ll*(H+l)/2 
DIV-RECIP(  M) 

VJn(M)  - YR(M)«DIV 
WC(M)  - yC(M)*DIV 
TR  - WR(M) 

TC  - WC(M) 

M - M - 1 
DO  90  J-1,M 

K - M - J 

L - I - J 

YR(K  + 1)  - YR(K  + 1)  - UR(L)»TR  ♦ UC(I.)*TC 

YC(K+1)  - YC(K+1)  - UR(L)»TC  - UC(L)*TR 
CONTINUE 
I - I - M - 1 
IP( I. HE. 1) GOTO  80 
DIV-RECIP(  1) 

WR(  1)  - YR(  1)*DIV 

WC(  1)  - YC( 1)*DIV 

CONTINUE 

CALL  SEC 

IW4-IRSEC 

IC4-ICSEC 

DO  994  JL0P-1,L0P 

CONTINUE 

CALL  SEC 

IW-inSEC-IW4 

IC-ICSEC-IC4 

TDECMP-T( IC2,ICl,IC,LOP) 

VroECMP-T(  IW2  , IV71 , IW,LOP) 

TBKSni-T(  IC3,IC2,IC,LOP) 

VVBKCB1'’T(  IW3  , IVf2  , IV/.LOP) 

TBKSB2-T( IC4,IC3,IC,LOP) 

VroKSB2-T(  IW4  , IV73  , IV/.LOP) 

RETURN 

END 
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233 

FUNCTION  NEXDEC(N) 

234 

IF  (N.GE.20)  GOTO  1 

235 

NEXDEC-20 

236 

RETURN 

237 

1 

IF  (N.GE.2O0)  GOTO  2 

238 

NnXDEC-N+20 

239 

RETURN 

240 

2 

NEXDEC-0 

241 

RETURN 

242 

END 

243 

I DENT 

SEKONB 

244 

USE 

/TIME/ 

245 

ICSEC 

BSS 

1 

246 

IRS  EC 

ESS 

1 

247 

USE 

* 

248 

ENTRY 

SEC 

249 

SEC 

BSS 

1 

2 50 

RTIflE 

inSEC,7CLK 

251 

TIME 

ICSEC, 7CLK, USER 

252 

JP 

SEC 

253 

END 
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Appendiit  H.  A Necessary  Condition  for  the  Nonsingularity  of  a 
Sample  Covariance  Matrix 


Given  a collection  of  m complex  n-dimensional  sample  vectors, 

their  sample  covariance  matrix  is  defined  by 

M = 5 = y (1) 

“ k=l  “ k= 1 “ 

where  is  the  conjugate  transpose  of  so  that  the  ij^^  component 

of  M^*^^  is 

^(k)  ^ 5(k)  ^(k)  ^2) 

1 J 

We  claim  that  M is  singular  if  m < n,  i.e.,  if  the  number  of  sample  vectors 
is  less  than  their  dimension. 

Proof;  First  we  need  a definition  and  a lemma. 

Definition:  The  rank  of  a matrix  ^ is  the  dimension  of  the  space  spanned  by 
its  columns. 

Note  that  a square  matrix  A is  nonsingular  if  and  only  if  its  dimension  equals 
its  rank,  rank(A).  This  is  because  ^ is  nonsingular  if  and  only  if  its  columns 
(or  rows)  are  independent,  i.e.,  l columns  span  a space  of  dimension  Z for  any 
subset  of  Z columns. 
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Lenna: 


Rank  (A  + ^)  < rank  (A)  + rank  (B).  » 

Proof:  Given  a set  of  vectors  which  span  a space  of  dimension  n.  and  a 

set^  of  vectors  which  span  a space  of  dimension  nj^,  the  set  of 

vectors  QJjS  clearly  spans  a space  of  dimension  < n + n.  , so  any 

set  S of  linear  combinations  of  vectors  inCl'U'S-  spans  a space  of 

dimension  < n,  + n.  . Now  ^et(Ji=  {columns  of  A},S  = (columns 
a D — 

of  , and  S = {columns  of  A + B^}  and  the  result  follows.  Q.E.D. 

Corollary:  Rank  ( T.  < y rank  • 

'k=l  ~ ' ~ k=l 

Proof:  Follows  immediately  from  the  lemma  by  induction.  Q.E.D. 

Now  we  may  continue  with  the  main  proof. 

'nr*  thi»  colunns  of  are  all  multiples  of  the  vector  by  Eq.  (2), 

(kl 

by  the  columns  of  M'  ' so  that  space 
could  be  0 if  all  the  components  of 


»•« 'or  spans  the  space  spanned 

»»«■  -ana  ' (the  rank 


NOTE:  This  proof  presents  only  a necessary  condition  for  the  nonsingularity 
of  the  sample  covariance  matrix,  not  a sufficient  one.  In  other 
words, the  matrix  may  still  be  singular  even  if  the  number  of 
samples  exceeds  the  dimension.  The  interested  reader  is  referred 
to  "Rapid  Convergence  Rate  in  Adaptive  Arrays"  by  Reed,  Mallett 
and  Brennant  for  a sufficient  condition  on  the  number  of  samples 
for  good  results. 


Appendix  I.  Number  of  Bits  Needed  for  Sample  Covariance 
Matrix  Calculation 

This  appendix  deals  with  the  lower  bound  on  computational  complexity 
of  forming  the  sample  covariance  matrix  and  the  accuracy  to  which  it  can 
be  computed.  It  also  includes  a section  on  how  these  bounds  are  related 
and  may  be  changed  due  to  machine  limitations. 

The  i^^  sample  covariance  matrix  can  be  defined  as 

1 -i*  i i-1 

= X’  X’  + m’  ' i = 1,2,... 

where 

x’  is  the  i^^  input  voltage  sample  vector  (i.e.,  row  vector) 

is  the  conjugate  transpose  of  X^ 
m’  is  the  covariance  matrix 
M°  elements  all  have  value  of  zero. 

We  will  also  define  the  following: 

N is  the  number  of  weights  in  the  system;  this  is  equal  to  the 
number  of  elements  in  X. 

S is  the  number  of  samples  used  to  form  M.  It  has  been  shown 
that  by  setting  S = 2*N,  the  weights  will  be  computed  within 
3 dB  of  optimal.* 

A.  COMPUTATIONAL  COMPLEXITY 

Since  M is  Hcrniti.in,  only  tlio  lower  or  upper  triangular  region  needs  to  be 
computed.  We  will  assume  that  the  upper  triangular  region  is  being  computed. 

* 

, ••I  'll  ■eive*' 'aemi  V ••>• 

I Ill  ».  - . ••>-*  • ••  » •'  ' 
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This  section  of  the  matrix  contains  N*{N+l)/2  elements.  Each  element 
of  the  vector  X is  only  multiplied  by  a conjugate  of  an  element  of  X to 
update  a location  in  M.  A program  for  a serial  machine  would  look  like 
the  following: 

DO  I = 1,N 

DO  J = 1 ,N  * 

M{I,0)  = CONJUGATE  (X{I))x(J)+  M{I,J) 

END  DO 
END  DO 

In  actual  practice,  only  N(N+l)/2  multiplies  must  be  computed,  with 

N(N+l)/2  adds  needed  to  update  M.  A program  for  a serial  machine  would  now 

look  like  the  following,  assuming  that  the  elements  of  M are  stored  column-wise: 

INX  = 0 
DO  I = 1,N 

INX  = INX  + I 
ONX  = INX 

DO  J = I,N  „ 

M(JNX)  = CONJUGATE  (X(I))  X(J)  + M(JNX) 

JNX  = JNX  + J 
END  DO 
END  DO 


B.  COMPUTATIONAL  ACCURACY 


Accuracy  will  be  expressed  in  number  of  bits.  To  compute  the  number  of 
bits  needed,  two  quantities  must  be  known:  1)  the  sample  size,  ^ and 
2)  the  number  of  bits  from  the  analog  to  digital  converter  (ADC),  B. 

Whenever  a binary  add  is  performed  on  two  words  of  a and  b bits  long, 
the  resultant  sum  can  require  MAX  (a,b)+l  bits.  Whenever  a binary  multiply 
is  performed  on  two  words  of  a and  b bits  long,  the  resultant  product  can 
require  a^b  bits. 


Elements  of  X are  expressed  as  a real  and  Imaginary  part,  each  B bits 

long.  Performing  multiplications  will  require  2 8 bits.  Each  complex  multiply 

* 

provides  an  add,  so  each  individual  term  of  X X will  require  p*B  + l)bits. 

These  values  must  be  added  to  the  previous  computed  values  S-1  times. 

In  vector  notation  the  formula  is 


Therefore  the  total  number  of  bits  required  is  [1062  S*(2^*®^^-1 )+l  1 
where  fXl  means  the  smallest  integer  greater  than  or  equal  to  X (the 
ceiling  function).  This  formula  is  derived  as  follows:  The  maximum  number 
which  can  be  expressed  in  k bits  is  2^-1.  If  we  are  to  sum  values  of  b bits, 
the  worst  case  is  2*’-l . 

Let 

k be  no.  of  bits  required  for  answer 
b be  no.  of  bits  in  items  to  be  summed 
S be  no.  of  items  to  be  summed 
k,b,  & s all  integers,  c,b,s>0 
S*(2’^-l)  < 2'^-l 
Solve  for  k,  as  follows: 

S*(2^-l)  + 1 < 2*^ 

L0G2(S*(2^-1)  + 1)  < k 

rL0G2(S*(2^-l ) + 1)1  = k because  all  variables  are  integers. 


Solve  for  S 


To  choose  some  numbers  for  examining  this  function  let  B = 12  bits  and 
S = 2*N,  when  N “ 200. 

fL0G2  S*(2^*®^’-])+ll 
fLOG^  400*(2^^-l)+ll 

34  bits  required. 

Table  I-l  lists  different  computers  and  their  mantissa  length  in  bits.  The 
table  also  lists  the  maximum  N allowed  with  S * 2*N. 

C.  WHAT  TO  DO  IF  MACHINE'S  ACCURACY  IS  INSUFFICIENT 

There  are  a couple  of  approaches  to  this  problem.  Before  any  program  is 
discussed,  however,  a deeper  understanding  of  the  calculations  in  Section  B is 
requi red. 

In  Section  B,  we  calculated  the  number  of  bits  needed  to  ensure  that 
no  errors  are  introduced  into  the  calculations;  all  values  were  treated 
as  fixed  point.  Almost  all  advanced  computers  have  floating-point  hardware 
on  them  and,  in  fact,  this  hardware  will  be  used  during  the  inversion  process 
to  minimize  errors  introduced  by  division. 


METHOD  1 

One  solution  to  the  accuracy  problem  is  to  ignore  it  and  let  the  floating- 
point hardware  take  care  of  it.  There  is  one  problem  with  this  approach, 
which  only  occurs  when  the  number  of  samples  is  very  large.  All  multiplies 
are  on  similar-magnitude  numbers  (namely  B bits)  and  all  adds  to  perform 
complex  multiplies  are  on  similar-magnitude  numbers  (namely  2*B  bits).  After 
we  have  added  up  a large  number  of  samples,  the  magnitude  of  the  values  for 

M . . may  be  much  larger  than  (2*B+1)  bits.  In  that  case,  the  addition  may  only 
★ 

add  in  part  of  this  product,  or  v;orse,  none  of  it. 

As  an  example,  assume  that  the  mantissa  is  30  bits  long  with  exponent, 
and  B » 12  bits  as  before. 

How  many  samples  can  be  added  before  we  lose  5 bits  of  a product  term? 
That  is  the  Sfime  as  asking , How  many  samples  can  we  accumulate  In  35  bits? 

By  using  the  formula  from  Section  B we  have 


S = 2^°  = 1024  samples  . 

Another  question  is.  How  many  samples  can  be  summed  before  an  error  is  intro- 
duced ? Again  with  30-bit  mantissa,  we  apply  the  same  formula: 
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s ■ [j2*b;1  J 


s = 


S = 2^  = 32  samples 

We  can  also  compute  when  any  additional  samples  will  be  simply  ignored,  i.e., 
will  not  change  the  running  sum  at  all.  Again  with  the  same  assumptions  as 
before 

Total  no.  of  bits  required  « 30  + 25  » 55 


S = 2^°  + 2^  samples  , 

which  is  indeed  a large  number,  but  by  the  time  we  have  reached  this  limit, 
the  cumulative  error  is  very  large. 

The  inversion  process  introduces  other  errors,  though  some  errors  in  the 
covariance  matrix  may  be  acceptable  due  to  the  accuracy  requirements  of  the 
weights.  Whether  the  existing  floating-point  hardware  is  sufficient  or  not 
will  depend  upon  the  particular  system  specifications. 

METHOD  2 

There  is  another  technique  to  limit  the  size  of  the  numbers.  The  sample 
covariance  matrix  represents  the  expected  value  between  the  i^^  and 
component  of  the  signal. 
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The  expected  value  is  approximated  by  averaging  a set  of  samples  together. 


This  can  be  represented  as 


M,,  = F 


ij 


k=l 


The  division  has  the  effect  of  reducing  the  number  of  bits  required.  Instead 
of  computing  the  entire  sum,  partial  sums  can  be  computed  and  averaged.  The 
computation  is  now 


ms  + s 


L 

k=n6+l 


By  choosing  s appropriately,  the  total  number  of  bits  required  can  be  reduced. 
Again, choosing  s is  based  upon  the  particular  system  (ADC,  computer,  accuracy 
of  weights,  etc.  )i  By  doing  this  calculation  we  add  -^^^^divisions  to 
compute  M. 

It  should  be  noted  that  if  M is  produced  by  the  method  of  Section  B,  it 
may  be  advisable  to  divide  each  element  by  S to  reduce  the  number  of  bits 
required.  This  is  allowable  because  it  only  changes  all  the  weights  by  a 
constant  factor,  and  due  to  the  weighting  processor,  multiplicative  constant 
factors  have  no  effect  upon  choosing  optimal  weights,  i.e.,  for  given  optimal 
weights,  W,  all  constant  multiples  are  also  optimal. 

It  should  also  be  noted  that  for  all  divisions,  the  divisors  are  real 
numbers. 

Let  us  run  through  an  example  using  averaging.  Assume -mantissa 
length  is  30  bits,  B = 12.  What  is  the  maxi  taim  S and  s that  can  be  used? 


1-7 


s = 2®  = 32 

Each  division  by  s reduces  the  product  back  to  30  - 5 = 25  bits.  We  now 
must  compute  how  many  sums  of  25-bit  numbers  are  needed  to  reach  the  30- bit 
limi t. 

No.  of  sums  * 

No.  of  sums  = 32 
S = No.  of  sums  * s 
32  * 32  = 1024 


If  we  assume  S = 2*N,  then  this  method  with  the  number  of  bits  specified 
will  handle  a system  of  512  weights. 

METHOD  3 

Methods  1 and  2 can  of  course  be  combined.  The  main  problem  with 
Method  1 is  the  errors  introduced  by  adding  numbers  of  greatly  differing 
magnitudes.  This  can  be  overcome  by  calculating  M in  the  fsllowing  manner. 


ms+s 


£ 

k’-msfi 


1 J 
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By  calculating  sums  of  s terms  and  adding  them  together,  we  will  be  adding 
numbers  of  similar  magnitude.  In  the  extreme,  the  additions  will  be  per- 
formed in  a tree-like  manner  as  shown  in  the  following  example. 

Sums  of  samples  1 to  s,b  bits 

b+1  bi 

s+1  to  2s 
2s+l  to  3s 

+]>>  b+1  bits 

3s+l  to  4s 

Assuming  only  b bits  in  the  mantissa,  this  procedure  minimizes  errors. 

At  a minimum  this  method,  as  well  as  Method  2,  requires  N(N+l)/2  more 
storage  locations. 

METHOD  4 

In  this  method,  the  entire  covariance  matrix  calculation  is  done  in  fixed- 
point  math.  On  the  machines  we  are  concerned  with,  fixed-point  is  implemented 
as  integer  math. 

On  machines  such  as  the  PEPE  and  STARAN  which  have  large  bit  length 
integers  (PEPE  can  use  46  bits  and  STARAN  has  variable  length),  this  method 
seems  better  than  losing  accuracy  by  using  a short  floating-point  format. 

The  STARAN  is  unique  among  the  computers  being  studied  in  that  there  is 
no  fixed  word  length  for  numbers.  A STARAN  word  can  be  divided  into  fields  of 
any  number  of  bits.  The  STARAN  word  length  is  256  bits  on  the  old  model  and 
9K  on  the  new  model.  The  STARAN  performs  integer  instructions  in  bit  serial 
manner,  so  that  ADD  and  MULTIPLY  times  are  directly  a function  of  the  number 
of  bits.  To  properly  use  this  machine,  the  bounds  on  bit  length  will  be 
used  to  minimize  running  time. 


I-P 


D.  SUMMARY 

We  notice  that  the  general-  purpose  scientific  machines  all  have  sufficient 
bits  to  handle  the  covariance  matrix  calculation.  We  will  take  advantage  of 
this  by  performing  the  detail  simulations  on  the  7600  and  only  using  the 
supercomputers  to  obtain  timing  estimates  and  see  the  suitability  of  their 
architecture. 

The  two  exceptions  to  the  above  are  the  STARAN  and  PEPE.  In  these  two 
cases  the  calculations  will  be  done  in  integer  and  then  converted  to  floating- 
point for  the  inversion  process.  These  integer  calculations  can  be  simulated 
on  the  7600,  so  again  these  machines  only  need  to  be  examined  for  speed  con- 
siderations and  the  applicability  of  their  hardware  architecture  to  the 
problem. 

One  item  of  research  is  to  compute  what  the  loss  is  on  the  24-  and  23-bit 
mantissa  machine.  The  reasons  for  using  the  smaller  mantissa  length  is 
speed,  but  architecture  considerations  will  not  change. 
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Table  I-l.  Attainable  Accuracy  for  Covariance  Matrix  (ADC  is  12  Bits,  Sample 
Size  = 2 X No.  of  Weights). 


COMPUTER 

NO.  OF  BITS 

IN  FLOATING-POINT 

MAXIMUM  NO. 

MANTISSA 

OF  WEIGHTS 

CDC  7600 

SINGLE  PRECISION 

48 

222 

DOUBLE  PRECISION 

96 

2^0  + 2^^  + : 

CDC  STAR- 100 

HALF  PRECISION 

23 

TOO  SMALL  -0 

SINGLE  PRECISION 

47 

2^1 

CRAY-I 

SINGLE  PRECISION 

48 

222 

TI  ASC 

SINGLE  PRECISION 

24 

TOO  SMALL  -0 

DOUBLE  PRECISION 

88 

262  , 237  ^ ; 

ILLIAC  IV 

SINGLE  PRECISION 

24 

TOO  SMALL  -0 

DOUBLE  PRECISION 

48 

222 

PEPE 

SINGLE  PRECISION 

23 

TOO  SMALL  -0 

STARAN 

VARIABLE  PRECISION 

N/A 

INTEGER 

I-ll 


Appendix  J.  Parallel  Implementations  of  Computing 
the  Sample  Covariance  Matrix 


We  have  already  outlined  the  computations  needed  to  compute  the  sample 
covariance  matrix.  We  will  now  discuss  methods  for  computing  it.  We  will 
first  show  a parallel  special-purpose  hardware  approach.  We  will  use  this 
as  a model  to  develop  approaches  for  parallel  and  vector  machines. 

Let 

be  the  i^^  voltage  vector 
be  the  I part  of  the  i^^  element  of  X 
X.p  be  the  Q part  of  the  i^^  element  of  X 
be  the  i^^  sample  covariance  matrix. 

Then 

X^  + where  M°  contains  all  zero  elements. 

This  can  be  rewritten  as 


'^ijl  “ ’^il 
”ijQ  “ ^iQ  ’'jl 


- x^  x^ 


+ M 


i-1 

ijl 


+ M 


i-1 

ijQ 


The  hardware  is  the  same  for  both  the  real  and  imaginary  parts  except 
for  an  adder/subtractor.  The  hardware  is  also  identical  for  every  ij  pair. 
This  commonality  will  be  exploited  in  terms  of  software  (‘esigns  and  could 
be  exploited  in  hardware  designs  by  having  common  replacement  modules. 
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The  following  circuitry  will  perform  these  operations: 


This  system  is  not  practical  for  a large-scale  problem:  e.g.,  if  we  have  a 
system  of  200  weights,  it  would  require  80,400  multipliers  and  80,400  adders. 
With  large-quantity  buys,  the  multiplier  chip  will  cost  approximately  $70. 

The  total  cost  of  just  the  multiply  chips  would  be  $5,628,000. 

This  system  will  compute  the  covariance  matrix  very  quickly  as  all 
multiplies  are  performed  in  parallel,  followed  by  only  two  stages  of  adding. 
Using  times  of  200  nsec  per  multiply  and  50  nsec  per  add,  the  total  time 
required  will  be  300  nsec  * no.  of  samples.  Assuming  a 200-weight  system, 
we  will  require  400  samples  (2*200)  to  obtain  a running  time  of  120  ysec 
with  a sample  every  200  nsec. 

Since  the  limiting  factor  in  terms  of  sampling  rate  is  the  multiplier 
rates,  the  system  can  be  modified  by  changing  multipliers  so  that  there 
are  sufficient  multipliers  for  each  ij  pair  to  deliver  results  to  the  adders 
at  their  rate.  This  can  be  illustrated  better  in  a timing  diagram.  We  will 
use  4 multipliers  of  200  nsec  and  one  adder  of  25  nsec.  This  system  will 
sample  at  a rate  of  one  sample  every  50  nsec.  It  should  be  noted  that  this 
system  requires  4 times  as  many  multipliers. 

To  put  the  sample  algorithm  on  a parallel  processor,  we  will  first 
assume  we  have  as  many  processors  as  required  and  each  processor  can  add, 
subtract  and  multiply.  Let  us  assume  we  have  one  processor  box  for  each 
multiplier.  The  processors  will  all  execute  the  same  code  in  lockstep, 
as  do  STARAN,  PEPE,  and  ILLIAC  IV. 


MULTIPLY  INPUT!, INPUT2 

We  have  completed  the  multiply  but  not  the  add  as  the  partial  results 
are  in  different  processors.  To  overcome  this  problem,  we  will  let  processors 
communicate  with  their  neighbor.  This  is  allowable  in  the  STARAN  and  ILLIAC 
IV  but  not  the  PEPE. 


CALCULATION 


Xi  * Xl 
il  jl 


* x^ 


x^  * 
iQ 


i 

JQ 


* X 


i+1 

jl 


xi+1  * X'*+l 
iQ  jQ 


xi'''^  * X^+l  _ xi+i  * X^'*’! 
II  jl  iQ  jQ 


* X^+2 

jl 


i+2  * x1+2 
iQ  jQ 


X^+2* 

il 

i+3 

il 

^ji 

i+3 

* X^+3 

jQ 

jQ 

* X 


i+2 

jQ 


xi+3  * xi+3  _ xi+3  * xi+3 
il  jl  iQ  jQ 


TIME  I 1 ' 

NSEC  0 200  400 
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The  reason  for  the  connection  between  P^g  and  P^j  is  for  uniformity 
in  the  design  even  though  the  connection  is  not  mathematically  required. 

The  program  is  now  as  follows: 

—^MULTIPLY  INPUT!,  INPUT2 

ADD  VALUE  FROM  RIGHT  PROCESSOR 

ADD  RUNNING  SUM 

STORE  RUNNING  SUM 

GOTO 

We  have  performed  these  operations  in  all  processors  so  we  have  done  extra 
work,  but  this  work  did  not  cost  us  any  time. 

The  system  is  still  not  realizable  on  existing  parallel  computers 
because  of  the  large  number  of  processors  required.  Assuming  a system 
of  200  weights,  we  would  require  80,400  processors. 

We  can  cut  the  number  of  processors  in  half  by  allowing  each  processor 
to  perform  all  the  computation  required  for  each  element  of  M. 


The  program  will  be  as  follows: 


MULTIPLY 

INPUT!, INPUT2 

STORE 

TEMPI 

MULTIPLY 

INPUTS, INPUT4 

ADD 

TEMPI 

ADD 

RUNNING  SUM 

STORE 

RUNNING  SUM 

L-GOTO 

This  procedure  does  not  require  any  processor-to-processor  communication, 
so  it  is  suitable  for  the  PEPE. 

Unfortunately,  the  number  of  processors  required  is  still  too  large 
for  existing  machines  for  the  problem  we  are  interested  in.  The  obvious 
way  of  fitting  this  problem  to  existing  machines  is  to  simply  partition 
the  program  into  subproblems  which  can  be  handled.  In  the  case  of  200 
weights,  we  have  20,100  complex  terms  to  compute.  If  we  have  N processors 
available,  we  will  compute  N complex  terms  at  a time  and  iterate  20100/N 
times. 

Another  approach  known  as  the  "global  method,''  which  is  based  upon 
the  STARAN  architecture,  is  as  follows: 
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•X 


Set  1 to  1 

Load  into  the  global  register 

Multiply  all  appropriate  elements  by  this  value 

Load  into  the  global  register 

Multiply  all  appropriate  elements  by  this  value 

Do  parallel  adds 

i -t.i  + 1 
If  (i<N) 

The  execution  time  of  this  algorithm  is  a function  of  N.  If  the  number 
of  processors,  M,  is  greater  than  N,  then  the  execution  time  is  proportional 
to  N.  If  N is  greater  than  M,  which  will  usually  be  the  case  for  large  M, 
we  will  do  M elements  at  a time.  The  execution  time  is  then  proportional 
to  the  number  of  groups  which  the  problem  must  be  devoted  to. 

For  a triangular  system, 

6 = (P+1 ) ( ^ N-P*M  ^ where 

f 'LSJ  • 

For  the  complete  matrix. 

Both  of  these  equations  reduce  to  N when  M^N.  This  means  that  in 
this  case,  the  entire  matrix  can  be  obtained  in  the  same  time  as  only  the 
triangular  matrix. 

— ■ ■ ■ ■ m 
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The  drawback  to  this  approach  is  that  not  all  parallel  processors  will 
be  active  at  all  times,  but  this  multiply  is  faster  than  the  multiply  which 
uses  two  values  in  the  same  word.  Me  would  like  a system  which  always  uses 
the  maximum  number  of  available  processors  so  that  execution  time  will  be 
minimal . 

This  leads  to  another  method,  to  be  known  as  the  " packed  method." 

We  will  illustrate  these  approaches  by  means  of  examples. 

Completely  Parallel  Computation  Approach 


Assume  the  number  of  processors  is  greater  than  the  number  of  weights. 

N = 3 no.  of  weights 
M = 4 no.  of  processors 

Pro- 


cessor 

A 

B 

c 

0 

1 

*11 

^IQ 

’^ll 

XlQ 

"ill 

"llJ 

^21 

^2Q 

X31 

3Q 

M 

231 

”23J 

2 

*11 

*.Q 

*21 

^2Q 

^12J 

•^120 

Si 

^ 3Q 

Si 

^3Q 

M 

"331 

^330 

3 

*11 

*iq 

*31 

^^131 

”l3J 

4 

*21 

*2« 

*21 

^2Q 

•^221 

”22J 

Computation  Group  1 
Time  Steps  1-8 

A*C  - B*D 
A*D  + B*C 


Computation  Group  2 
Time  Steps  9-16 
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r.  -..J 


The  upper  triangular  matrix  is  updated  in  16  time  steps.  We  notice  that 
in  the  second  group,  two  processors  are  available  for  work.  We  will 
now  rework  the  example;  using  3 processors. 


Pro- 

cessor 


1 

^11 

IQ 

”lll 

”llJ 

21 

2Q 

21 

2Q 

M221 

'^22J 

2 

11 

IQ 

21 

2Q 

^121 

”l2J 

21 

2Q 

31 

3Q 

^231 

^23J 

3 

11 

IQ 

31 

3Q 

^^131 

”l3J 

31 

3Q 

31 

3Q 

”331 

^330 

Group  1 

Group  2 

Time  Steps  1-8 


Time  Steps  9-16 


An  interesting  observation  can  be  made:  namely,  that  the  execution 
time  is  the  same  with  one  less  processor.  It  can  also  be  shown  that 
the  evaluation  time  will  also  be  the  same  with  5 processors. 

The  total  number  of  complex  terms  is  N(N+l)/2  for  the  triangle  form  of 
2 

the  matrix  and  N for  the  full  matrix.  If  we  have  M processors,  then  the 
number  of  groups,  G,  will  be 


G *=  triangular  matrix 

r ^ 

G ■»  I jij-  for  the  complete  matrix 
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where  fx"!  is  smallest  integer  greater  than  or  equal  to  x,  i.e.,  the 
ceiling  function. 

These  equations  can  be  used  in  the  following  ways: 

Given  the  maximum  time,  T,  allowed  to  compute  the  covariance  matrix 
and  the  dimension,  N,  what  is  the  minimum  number  of  processors  required? 

We  will  now  relate  number  of  processors,  M,  and  number  of  weights, 
N,  to  determine  number  of  groups,  G,  needed  to  perform  the  covariance 
update.  The  running  time  is  a constant,  C,  times  the  number  of  groups. 

The  time  per  group  is  t,  then  the  number  of  groups  required  is 
G = fT/tl  . We  then  solve 

G = ® 

H.fMlUl 

Because  of  the  ceiling  function  the  total  time  may  be  less  than  T. 

If  a machine  such  as  STARAN,  PEPE  or  ILLIAC  IV  is  available,  then 
M will  be  chosen  as  the  maximum  available. 

If  a machine  is  to  be  designed  then  the  above  computation  will 
be  performed. 

Because  for  a given  G and  N,  more  than  one  M may  satisfy  the 

equations,  questions  of  designing  for  reliability  cotr } up. 

For  example,  let  N = 200,  G = 201 
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Then  M can  range  from  76  to  TOO.  If  the  system  1s  constructed  with 
76  processors  and  one  breaks,  then  G will  increase  to  268  with  a corres- 
ponding increase  in  running  time.  If,  however,  100  processors  were  con- 
structed, then  up  to  24  processors  could  break  without  degrading  system 
performance.  Neither  extreme  is  probably  practical.  By  calculating  the 
MTBF,  MTTR  and  MAXTTR  of  processors,  one  can  determine  the  number  of 
processors  above  the  minimum  number  of  processors  which  should  be  specified. 

Another  question  of  reliability  is.  What  occurs  if  N becomes  smaller, 
such  as  if  a receiver  breaks?  By  looking  at  the  equation  G ] > 

it  is  obvious  that  for  N^  and  N2  such  that  < N2  and 


N, (Nn+l )- 


and  G. 


i2(N2+l)1 

2M  I 


Then  G-j  _<  G2.  This  means  that  for  M fixed,  there  will  be  no  increase 
in  running  time  as  N decreases. 

Comparing  the  global  and  packed  methods  for  a triangular  matrix,  we 
have  the  following: 


N = 200,  M = 256  (1  STARAN  array) 


Global 


Packed 


'“'Ir 


G = (P+1)  (MP/2  + (M=P„)) 


P = 0 


G = 200 


This  implies  that  the  packed  form  is  worthy  of  study.  Currently  the 
algorithms  are  not  known  for  implementing  it  on  the  STARAN  and  PEPE.  Also 
in  the  example  given  N<M  so  we  are  not  utilizing  N-M  = 56  processors. 


Appendix  K.  Parallel  Direct  Implementations  to  Solve  MW  = 5" 

2 

with  0{N  ) Processors 

2 

These  implementations  were  developed  by  noticing  that  M has  N elements, 
and  that  it  would  be  possible  to  assign  one  processor  to  each  element.  If 
only  the  upper  or  lower  triangular  matrix  is  stored,  then  N{N+l)/2  elements 
are  required.  By  assigning  one  element  to  each  processor,  we  hope  to  obtain 
a speed  increase  over  sequential  or  parallel  techniques  with  0(N)  processors. 

We  will  look  in  detail  at  two  algorithms:  Gauss-Jordan  and  Cholesky. 

The  reason  for  selecting  Gauss-Jordan  is  its  inherent  parallelism  of  select- 
ing a two  and  then  eliminating  on  all  other  rows.  Cholesky  was  chosen  be- 
cause it  requires  only  half  of  the  matrix,  and  we  feel  the  difference  between 
N processors  and  N{N+l)/2  processors  is  sufficiently  large  for  large  N and 

that  this  algorithm  is  worth  exploring.  For  other  implementations  see  Sameh 
* 

and  Kuck  . 

Gauss- Jordon  Implementation 

We  assign  to  each  processor  an  element  of  processor,  p contains  element 

2 

M.  such  the  p = (i-l)N  + j;  1 < i,  j < N.  This  requires  N processors.  We 

j 

use  another  N processor  to  hold  the  elements  of  the  steering  vector. 

We  will  illustrate  the  technique  on  a 3 x 3 matrix.  Figure  K-1. 

Sameh,  A.  H.,  and  D.  J.  Kuck,  Linear  System  Solvers  for  Parallel  Computers. 
Department  of  Computer  Science.  Report  No.  UILICDCS-R-75-701  , University  of 
Illinois  at  Urbana-Champaign,  February  1975. 
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We  have  now  eliminated  on  the  first  row  in  three  steps.  Since  there  are  H 
rows.  Gauss- Jordan  will  take  3N  steps.  All  of  these  steps  are  straightfor- 
ward except  the  expand  (outer  product)  step.  This  operation  requires  move- 
ment of  data  between  processors,  if  only  a global  register  is  available,  the 
data  movement  would  require  moving  2N-2  data  items  and  3N-3  processor  enables 
for  each  row.  Since  there  are  N rows,  data  movement  will  be  an  0(N  ) process 
and  the  direct  methods  using  0(N)  processors  are  comparable.  What  is  re- 
quired is  a special  interconnect  structure  which  is  able  to  perform  the  other 
product  in  time  independent  of  N,  preferably  in  one  or  two  steps.  This  inter- 
connect structure  is  possible  and  is  in  fact  similar  to  that  of  the  ILLIAC  IV. 
Again,  by  using  a 3 x 3 matrix,  the  interconnect  strength  looks  like: 

Col.  1 Col.  2 Col.  3 

Row  1 

Row  2 

Row  3 

To  propagate  the  values  as  in  the  previous  example,  we  move  all  data  verti 
cally  in  one  step  and  all  data  horizontally  in  one  step.  This  can  also  be 
accomplished  if  there  is  a global  register  for  each  row  and  column. 
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If  the  ILLIAC  IV  structure  is  used,  then  N-1  steps  would  be  required  for 


vertical  movement  and  N-1  steps  would  be  required  for  horizontal  movement. 

Although  this  discussion  has  dealt  with  performing  G-J,  as  shown  earlier, 
by  simply  adjoining  the  steering  vector  to  the  matrix  M,  we  will  obtain  the 
weights  at  the  same  time. 

Cholesky  Implementation 

In  this  system,  we  have  N(N+1)/2  parallel  processors.  Again  we  will 
demonstrate  the  technique  on  a 3 x 3 matrix,  Figure  K-2. 
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As  in  the  previous  section  on  G-J,  the  outer  product  requires  an  interconnect 
structure  to  use  this  system  to  advantage. 

This  process  leaves  the  L decomposition  of  M in  the  N(N+l)/2  processors. 
We  must  still  solve  for  the  weights.  This  can  be  done  by  two  back  substitu- 
tions or  adjoining  T to  the  original  matrix  and  performing  one  ‘^ack  substitu- 
tion as  shown  in  Part  4.  A back  substitution  can  be  performed  in  N processors 
as  outlined  in  the  discussion  on  PEPE.  We  can  use  either  N processors  or  the 
original  N(N+l)/2  processors.  If  another  set  of  processors  is  used,  however, 
we  can  pipeline  the  processors  to  work  on  more  than  one  covariance  matrix  at 
a time.  (See  Figure  K-3.) 
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Figure  K-3  Pipeline  Decomposition  and  tack  Substitution 
where  t^  = 0(N) 
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Appendix  L.  Implementations  of  Algorithms  to  Solve  for 
Adaptive  Weights  on  the  PEPE 

All  the  implementations  in  this  section  are  written  in  an  easily 
understood,  structured  progranming  language.  These  implementations  are 
presented  for  operation  counts  purposes  only  and  are  not  intended  to  be 
compilable  code.  For  a discussion  of  these  implementations,  see  Section 
4. 4. 2. 2,  "Implementations  of  Algorithms  to  Solve  for  Adaptive  Weights  on 
the  PEPE." 

The  variables  used  in  these  implementations  may  be  divided  into  two 
classes,  depending  on  whether  they  are  stored  in  sequential  memory  (S)  or 
parallel  memory  (P).  Parallel  variables  are  distinguished  from  sequential 
ones  by  a "*"  as  a subscript.  For  example,  the  parallel  variable  R(*) 
refers  to  corresponding  memory  locations  in  each  PE.  The  parallel  array 
X(N,*)  is  stored  in  N memory  locations  in  each  PE.  When  a parallel 
assignment  statement  is  executed,  such  as 

X(l,*)  = R(*)*X{2,*)  , 

the  multiplication  is  performed  in  each  currently  enabled  processor  simul- 
taneously. 

Mixed  sequential-parallel  assignment  statements  are  allowed.  If  a 
parallel  variable  is  on  the  left  side  of  the  equal  sign, 

R(*)  = mixed  expression  (arithmetic  expression  co  taining  both 
sequential  and  parallel  variables), 

any  number  of  PE's  may  be  active,  and  each  distinct  sequential  variable 
in  the  mixed  expression  must  be  moved  from  sequential  to  parallel  memory 
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via  an  S-P  operation.  If  a sequential  variable  is  on  the  left  side  of 
the  equal  sign, 

H(K)  = mixed  expression, 

exactly  one  PE  must  be  enabled,  and  the  value  calculated  by  the  mixed  ex- 
pression in  that  PE  will  be  transferred  to  sequential  memory  by  a P-S  oper- 
ation. 

The  statements  "ENABLE  I" or  "ENABLE  I TO  J"  or  "ENABLE  ,l2, . . • ,1^" 
enable  the  indicated  PE's  and  turn  off  all  the  others.  The  enabled  PE's 
remain  enabled  until  the  next  "ENABLE"  statement  is  encountered. 

Variables  will  either  be  of  type  integer  (I),  floating-point-real  (R), 
or  floating-point-complex  (C).  Complex  variables  are  stored  with  their 
real  and  complex  parts  in  separate  locations,  as  described  in  the  subsection 
of  4. 4. 2. 2 entitled  "Introduction,"  and  are  used  to  simplify  notation.  If 
it  is  necessary  to  refer  to  the  real  or  imaginary  parts  of  a complex  vari- 
able separately,  the  notations  REAL(‘)  and  IMA6(‘)  will  be  used.  (For  example, 
REAL(M(J,*))  refers  to  the  real  part  of  the  row  of  M in  all  active 
parallel  elements.)  Similarly,  CONJG(')  means  the  complex  conjugate  of  the 
complex  variable  in  parentheses. 

Sequential  variables  used  include  N(type  I,  the  dimension  of  the  prob- 
lem), K (type  I,  the  number  of  steering  vectors),  NS  (type  I,  the  number  of 
sample  /ectors),  RECIP(N)  (type  R,  the  reciprocals  of  the  diagonal  elements 
of  the  factor  matrix),  and  X(N)  (type  C,  the  sample  vector).  Parallel  vari- 
ables include  the  sample  covariance  matrix  M(N,*)(type  C),  the  conjugated 
steering  vectors  S(K,*)  or  S(N,*)(type  C),  and  an  array  of  the  reciprocals 
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of  the  diagonal  elements  of  the  factor  matrix,  R(*)  (type  R--the  i^*^  PE 
contains  the  reciprocal  of  the  i^^  diagonal  element).  Sample  covariance 
matrix  will  be  abbreviated  to  SCM. 

Storage  schemes  are  rowwise  or  columnwise  for  the  (factored)  SCM,  and 
componentwise  or  vectorwise  for  the  steering  vectors,  as  discussed  in  the 
introduction  to  Section  4.4. 2. 2.  The  abbreviations  for  the  storage  schemes 
used  in  the  back  substitutions  are  given  in  Fig.  4.39c.  For  the  first  back 
substitutions,  and  second  back  substitutions  with  vectorwise  steering  vector 
storage,  it  is  assumed  the  steering  vectors  are  stored  in  the  first  K (com- 
ponentwise storage)  or  N (vectorwise  storage)  PE's.  The  second  back  substi- 
tutions with  componentwise  steering  vector  storage  may  either  follow  an 
augmented  or  an  unaugmented  decomposition.  In  the  first  case,  the  steering 
vectors  are  stored  next  to  M in  PE's  N+1  through  N+K  (see  Fig.  4.39e),  and 
in  the  second  case  they  are  stored  below  M in  PE's  1 through  K.  This  dif- 
ference does  not  affect  the  operation  counts  of  the  implementation,  just 
the  means  of  accessing  the  data.  To  represent  both  schemes,  we  have  chosen 
the  following  notation:  in  any  line  of  code  with  the  second  part  of  the 
line  set  off  in  square  brackets  ([]),  the  code  outside  the  brackets  will 
refer  to  the  augmented  case,  and  the  code  inside  brackets  will  refer  to 
the  unaugmented  case.  Similarly,  bracketed  variables  in  the  variable  list 
preceding  the  code  itself  are  used  in  the  unaugmented  case  only. 

The  first  back  substitutions  solve  LOT  = S when  M = LDL*  (LT  = S when 
M = LL*)  and  the  second  back  substitutions  solve  L*W=T  when  M=LDL*  (L*W=T 
when  M=LL*). 
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Before  each  implementation  is  the  number  of  PE's  required  and  a list 
of  all  variables  used,  with  their  lengths  (length  per  PE  for  parallel  vari- 
ables), locations  (SorC),  types  (I,R,  or  C),and  descriptions  of  their  con- 
tents (omitted  for  M,  S,  RECIP,  X,  R). 

FOR-END  FOR  loops  behave  as  do  those  in  Appendices  A and  B. 


Method  1 for  Updating  the  Sample  Covariance  Matrix 
Number  of  PE's  required  = N 


Variable 

Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39a 

X(N) 

2N 

S 

C 

Sample  vector 

XT(*) 

2 

P 

C 

transpose  of  sample  vector 

JROW 

1 

S 

I 

current  row  of  SCM  being 
calculated 
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a a 


BEGIN 

HEPEAT  THIS  CODE  FOR  AT,L  NS  SAMPLE  VECTORS 
MOVE  TPJVtJSPOSE  or  SAMPLE  VECTOR  TO  PE'S 

FOR  JROW  - 1 TO  N 
ENABLE  I 

>:T(  • ) - CONG  J(  X(  JROW) ) 

END  FOR 

C UPDATE  EACH  ROW 

FOR  JROW  - 1 TO  H 

ENABLE  JROW  TO  N 

M(JROW,*)  - M(JROW,*)  ♦ X(JROW)  • XT(*) 
END  FOR 
END 
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Method  2 for  Updating  the  Sample  Covariance  Matrix 
Number  of  PE’s  required  = N 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39a 

X(N) 

2N 

S 

C 

sample  vector 

XT(*) 

2 

P 

C 

transpose  of  sample  vector 

JROM 

1 

S 

I 

current  row  of  SCM 
being  calculated 
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o o 


BEGIN 


KEPCAT  THIS  CODE  FOR  ALL  HS  SAllPLE  VECTORS 
MOVE  TRANSPOSE  OP  SAMPLE  VECTOR  TO  PE'S 

FOR  JROW  - 1 TO  N 
ENABLE  JROV7 

. XT( • ) - CONG J(  X(  JROW) ) 

END  FOR 

C UPDATE  EACH  ROW 

ENABLE  1 TO  N 
DO  JROW  - 1 TO  N 

. M<JROW,  •)  - M(JROW,  *)  ♦ X{  JROW)*XT(  •) 

END  DO 

END 
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Method  3 for  Updating  the  Sample  Covariance  Matrix 
Number  of  PE's  required  = N’(N+1  )/2 


Variable  Name 

Length 

Location 

Type 

Descri ption 

M(N,*) 

2N 

P 

C 

initially  upper  triangle  of 

M stored  rowwise  in  N*(N+l)/2 
processors,  finally  as  seen 
in  Fig.  4.39a  but  upper  half 
only 

X(M) 

2N 

S 

C 

sample  vector 

X2(*) 

2 

P 

C 

to  conjugated  compo- 

nents of  sample  vector 
followed  by  2f‘^  to  conju- 

gated components  of  sample 
vector,  3'’‘1  to  Nth,  etc.,  in 
all  N-(N+l)/2  PE's 

xi(*) 

2 

P 

C 

s t 

1 component  of  sample  vector 

N times,  followed  by  2nd  com- 
ponent of  sample  vector  N-1 
times,  jnd  component  N-2 
times,  etc.,  in  all  N*(N+l)/2 
PE's 

MH(- ) 

N-(N-l) 

S 

C 

used  to  store  2'^'^  through 
rows  of  outer  product  of 
sample  vector 

ISTR 

1 

S 

I 

first  PE  to  enable 

ISTP 

1 

S 

I 

last  PE  to  enable 

I SUB 

1 

S 

I 

location  in  MM(M)  of  current 
element  of  SCM  being  moved  to 
M(MM) 

JROW 

1 

s 

I 

row  of  current  element  of  SCM 

JCOL 

1 

s 

I 

column  of  current  element  of  SCM 

o o 


BEGIN 

C REPEAT  THIS  CODE  TOR  ALL  «S  SAMPLE  VECTORS 

ISTR  - 1 

ISTP  “ 1 

C MOVE  SAMPLE  VECTOR  INTO  PE'S 

FOR  jnOW  - 1 TO  N 
. EHADLE  ISTR  TO  ISTP 
Xl(*)  - X(JROW) 

ISTR  - ISTP  ♦ I 

ISTP  - ISTP  ♦ N - JROW 

ENABLE  J,J+N-l,J+2*N-3 ,J+3*N-6 , . . . ,J+K»N-K*< K+l)/2, 

. . . ,< J-1)»N-J»(  J-3)/2 

X2( *)  - X(  JROW) 

END  FOR 

ENABLE  1 TO  N*(N+l)/2 

♦ X1(«)*X2(*) 


EXECUTE  THIS  CODE  AFTER  ALL  NS  SAMPLE  VECTORS  DONE 

FOR  ISUB  - N+1  TO  N’»{N+l>/2 
. ENABLE  ISUB 
. MM(  ISUB-N)  - M< 1,*) 

END  FOR 

FOR  JROVJ  - 2 TO  N 
ENABLE  JROW 

. ISUB  - N - 1 ♦ JROW 
. FOR  JCOL  - 2 TO  JROW 
. . MCJCOL,*)  - MMdSUBJ 

ISUB  “ ISUB  ♦ N - JCOL 
. END  FOR 
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Gtiuss-Jordan  (qj) 

Number  of  PE's  required  = n+K 


Variable  Name 

Length 

Location 

Type 

Description 

M(N.*) 

2N 

P 

C 

augmented  (see  Fig.  4.39e) 

R(*) 

1 

P 

R 

reciprocal  diagonals 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

TEMP(N) 

2N 

S 

C 

elements  of  JCOLth 
column  of  M 

JROW 

1 

s 

I 

current  row  being  eliminatec 

JCOL 

1 

s 

I 

current  column  being  zeroed 
out 

BEGIN 


C ELIMINATE  EACH  COLUMN 

FOR  JCOL  - 1 TO  H 
ENABLE  JCOL 

m*)  - l./REAL(M(  JCOL,*)) 
nECIP(JCOL)  - R(*) 

. FOR  JROW  - 1 TO  N EXCEPT  JCOL 
TEMPI  JEOW)  - Ml  JROW,*) 

END  FOR 

ENABLE  1 TO  N+K 

C * NORMALIZE  THE  JCOL-TH  ROW 

Ml JCOL,*)  - RECIPlJCOL)  * Ml  JCOL,*) 

C * ELIMINATE  THE  JCOL-TH  COLUMN 

FOR  JROW  - 1 TO  H EXCEPT  JCOL 

Ml  JROW,*)  - Ml  JROW,*)  - TEMPI  JROW)  • Ml JCOL,*) 
END  FOR 
END  FOR 
END 
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GE--unaugmented 
Number  of  PE's  required  = N 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39a 

R(*) 

1 

P 

R 

reciprocal  diagonals 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

TEMP(N) 

2N 

S 

C 

contains  JCOLth  column 
of  SCM 

JROW 

1 

S 

I 

current  row  being  eliminate' 

JCOL 

1 

S 

I 

current  column  being  zeroed 
out 
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BESIH 


C ELIMI^!^TE  EACH  COLUMN 

FOB  JCOL  - 1 TO  N-1 
ENABLE  JCOL 

R(  • ) - 1 . /BEALf  M(  JCOL , * ) > 

RECIP(JCOL)  - R(») 

FOB  JnOW  - JCOL+1  TO  H 

TEtlP<  JP.OV;)  - MtJROW,*) 

END  FOR 

ENABLE  JCOL  TO  N 

C • NORMALIZE  JCOL-TH  ROW 

M(JCOL,»)  - HECIP(JCOL)  • M(JCOL,*) 

C * ELIMINATE  JCOL-TH  COLUMN 

FOR  JROW  - JCOL+1  TO  N 

MCJROW,*)  - M(JnOW,  *)  - TEMP(JROW) 
END  FOR 
END  FOR 

C CALCULATE  LAST  RECIPROCAL  DIAGONAL 

ENABLE  N 

F(«)  - l./REALCM(N,*>) 

RECIP(  N)  - R(  •) 

END 
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, JL 


M( JCOL,*) 


LDL*”Unaugmented— optimi  zed 


This  implementation  is  identical  to  GE—unaugmented. 


LDL*--unauyinented--unoptimizecl 
Number  of  PE's  required  = N 

Variable  Name  Length  Location  Type  Description 


M(N,*) 

2N 

P 

c 

see  Fig.  4.39a 

R(*) 

1 

P 

R 

reciprocal  diagonal 

T(*) 

2 

P 

C 

temporary  containing  un- 
normalized JROWth  row  of 
SCM 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

TEMP 

2 

S 

C 

temporary  containing  the 
(JROW,JSUB)th  element 
of  SCM 

JROW 

1 

s 

I 

current  row  of  SCM  being 
calculated 

JSUB 

1 

s 

I 

row  from  which  multiple  ol 
JROWth  row  is  subtracted 

L-16 


BEGIN 


SUBTRACT  MULTIPLES  OF  EACH  ROW  FROM  FOLLOWING  ROWS 

FOR  jr.ov:  » 1 TO  h-i 

EIIABLE  JROM 

R(*)  - 1 . /REAL(  i!(  anov/,*) ) 

RECIft  JROV/)  - R(*) 

ENABLE  Ji;OW+I  TO  N 

* KOHALI3E  JROVJ-TH  ROW 
T(  *)  - M(  JROW,*) 

IKJROW,*)  - RECIP(JnOW)  * IKJROV?,*) 

* SUBTRACT  MULTIPLES  OF  JEOW-TH  ROW  FROM  FOLLOWING  ROWS 

. FOR  JSUB  - JROW+l  TO  N 
ENABLE  JSUB 
TEMP  “ T(*) 

ENABLE  JSUB  TO  N 

IKJSUB,*)  " MCJSUB,*)  - TEMP  • MCJROV/,*) 

END  FOR 
END  FOR 
ENABLE  N 

R(»)  - l./REAL(M(N,*)) 

RECIP(N)  - R(*) 

END 
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LL*--unaugmented--optimized 


Number  of  PE's  required  = N 

Variable  Name  Length  Location  Type  Description 


M(N ,*) 
R(*) 

RECIP(N) 

TEMP(N) 

SQR(*) 

SQ(N) 

JROW 

JCOL 


2ri 

1 

N 

2N 

1 

N 

1 

1 


P 

P 

S 

s 

p 

s 

s 

s 


C see  Fig.  4.39a 

R reciprocal  diagonals 

R reciprocal  diagonals 

C contains  JCOLth  column 

of  SCM 

R square  roots  of  diagonal 

elements 

R square  roots  of  diagonal 

elements 

I current  row  being  eliminated 

I current  column  being  zeroed 

out 
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no  n no  non 


BEGItJ 


Tins  IMPEEMEMTATION  IS  IDENTICAL  TO  GE-UN.VJGMENTED 
EXCEPT  POP.  SOIIE  ADDITIONAL  CODE  AT  END 
ELIMINATE  EACH  COLUMN 

FOR  JCOL  - 1 TO  K-1 
EIIADLE  JCOL 

R(*)  - 1 . /REALC  H(  JCOL,  *)  ) 

RECIP(  JCOL)  - R(  • ) 

FOR  JDOV)  - JCOL  + 1 TO  N 

TEMP(JROW)  - M(JROW,*) 

END  FOR 

ENABLE  JCOL  TO  N 

• NORMALIZE  JCOL-TH  ROW 

MCJCOL,*)  - RECIP(JCOL)  • M(  JCOL , * ) 

• ELIMINATE  JCOL-TH  COLUMN 
FOR  JROW  - JCOL+1  TO  N 

M(JROW,»)  - MCJROW,*)  - TEMP(JROW)  * MCJCOL,*) 
END  FOR 
END  FOR 

CALCULATE  LAST  RECIPROCAL  DIAGONAL 
ENABLE  N 

R(  *)  - 1. /REALC  MCN,*)) 

RECIPC  N)  - RC  *) 

CALCULATE  SQUARE  ROOTS  OF  DIAGONAL  ELEMENTS 
AND  EACH  ROW  OF  MATRIX  BY  CORRESPONDING  VALUE 

ENABLE  1 TO  N 

SQRC  »)  - 1 . /SQRTC  RC  » ) ) 

FOR  JROW  - 1 TO  N 
ENABLE  J 
SQCJ)  - SQRC*) 

END  FOR 
ENABLE  1 TO  N 
FOR  JROW  - 1 TO  N 

MCJROW,*)  - SQCJROV;:  * MCJROV),*) 

END  FOR 
END 
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LL*--unaugmented--unoptimi zed 
Number  of  PE's  required  = N 


Variable  Name 

Length 

Location 

Type 

Descri ption 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39a 

R(*) 

1 

P 

R 

reciprocal  diagonals 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

TEMP 

2 

S 

C 

temporary  containing 
(JROW,JSUB)th  element 
of  SCM 

JROW 

1 

S 

I 

current  row  of  SCM  being 
calculated 

JSUB 

1 

S 

I 

row  from  which  multiple  ( 
JROWth  row  is  subtracted 

L-2n 


BEGIN 


C SUBTRACT  MULTIPLE  OF  EACH  ROW  FROM  FOLLOV/ING  ROWS 

FOR  JROW  - 1 TO  n-1 
ENABLE  JROV7 

n(»)  - 1 . /SORTC  REALC  M(  JROW,«  ) ) ) 

RECIP(JROW)  - R(*) 

ENABLE  JROW  TO  N 

C * CALCULATE  JROW-TH  ROVI 

M(JROW,*)  - RECIP(JROW)  » MtJROW,*) 

C * SUBTRACT  MULTIPLES  OF  JROW-TH  ROW  FROM  FOLLOWING  ROWS 

FOR  JSUB  “ JROW+1  TO  N 
ENABLE  JSUB 
TEMP  - M(JROVJ,*) 

ENABLE  JSUB  TO  N 

M(JSUB,*)  “ M(JSUB,*)  - TEMP  • M(JROW,«) 

END  FOR 
END  FOR 
ENABLE  N 

RC")  - l./SQRT(  REAL(  M(  N,»)  ) ) 

RECIP<N)  - R(») 

END 
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GE--augmented 


P 


Number  of  PE's  required  = N+K 


Variable  Name 

Length 

Location 

Type 

Descri ption 

M(N.*) 

2N 

P 

C 

see  Fig,  4.39e 

R(*) 

1 

P 

R 

reciprocal  diagonals 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

TEMP{M) 

2N 

S 

C 

contains  JCOLth  column 
of  SCM 

JROW 

1 

s 

r 

current  row  being  eliminated 

JCOL 

1 

s 

T 

current  column  being  zeroec 
our. 

n n n o n n 


BEGIH 


T!iIS  IMPLEr’.EHTATIOtl  IS  ALV.OST  IDENTICAL  TO 
UEAUGI'.EIJTEO  GE  EXCEPT  ’ EUaDLE  JCOE  TO  11’ 

EECOIIES  ’ENABLE  JCOL  TO  H+iC’  AND  THE  LAST 
ROV;  IS  MULTIPLIED  BY  THE  LAST  RECIPROCAL  DIAGONAL 

ELIMINATE  EACH  COLUMN 

FOR  JCOL  - 1 TO  N-1 
ENABLE  JCOL 

R<*)  - l./REAL(M(  JCOL,*)) 

RECIP(JCOL)  " n(*) 

FOR  JROW  - JCOL+1  TO  N 

TEMPI  JROW)  - M(JROW,») 

END  FOR 

ENABLE  JCOL  TO  N+K 
C * NORMALIZE  JCOL-TH  ROW 

M(JCOL,*)  • RECIP(JCOL)  • MCJCOL,*) 

C * ELIMINATE  JCOL-TII  COLUMN 

FOR  JROW  " JCOL+1  TO  N 

MCJROW,*)  - MCJROW,*)  - TEMPI  JROW)  * M(JCOL,*) 
END  FOR 
END  FOR 

C CALCULATE  LAST  RECIPROCAL  DIAGONAL 

ENABLE  N 

R(*)  - l./REAL(M(N,*)) 

RECIP(  N)  - R(  •) 

C MULTIPLY  LAST  ROVl  BY  LAST  RECIPROCAL  DIAGONAL 

ENABLE  N+1  TO  N+K 
M(N,*)  - nECIP(N)  * M(N,*) 

END 
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LDL*--augmented--optimi zed 


This  implementation  is  identical  to  GE— augmented. 
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LDL*--augmented--unoptimi zed 
Number  of  PE's  required  = N+K 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39e 

R(*) 

1 

P 

R 

reciprocal  diagonal 

T(*) 

2 

P 

C 

temporary  containing  un- 
normalized JROWth  row  of 
SCM 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

TEMP 

2 

S 

C 

temporary  containing  the 
(JROW,JSUB)th  elements 
of  SCM 

JROW 

1 

S 

I 

current  row  of  SCM  being 
calculated 

JSUB 

1 

S 

I 

row  from  which  multiple 
of  JROWth  row  is  sub- 
tracted 
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on  o ooooono 


c:;gin 

Tins  imi.rrinriTi'.Tic:’  rs  amiost  Tpnr-T-rzvL  to 
LDA" -urrATCMKiiTr.D-uiiaPTrMTnm , i::-cr?T 

"niTiBAn  >;  to  ii"  Knee-.'::;!)  "ckadak  >:  to  ako 

Tii;:  LAST  ROW  IS  MSLTPLinD  EK  THE  LAST  RECIPROCAL 
DIAGOPAL 

SUBTRACT  MULTIPLES  OF  EACH  ROW  FROM  FOLLO'.;i:JG  ROWS 

FOR  JROW  - i TO  N-1 
ERADLE  JROW 

r(»)  - 1 . /REAL!  Jl(  JROVV,  *)  ) 

RECIP(JP.OW)  “ R(*) 

EUAELE  JROW+1  TO  N+K 

• NORMALIZE  JROW-TII  ROW 
T(*)  - M(JROW,*) 

M(jnOW,»)  - RECIP(JROW)  • MIJROW,'’) 

• SUBTRACT  MULTIPLES  OF  JROW-TH  ROVI  FROM 
» FOLLOWING  ROWS 

FOR  JSUn  JROV7+1  TO  N 
ENABLE  JSUB 
TEMP  = X(  * ) 

ENABLE  JSUB  TO  N+K 

Mi JSUB,*)  “ MiJSUB,*)  - TEMP  * MiJROW,*) 

END  FOR 
END  FOR 
ENABLE  H 

R(*)  " 1 . /REAL!  M(  N,*)  ) 

P.ECIP(N)  ” R(*) 

C MULTIPLY  THE  LAST  ROW  BY  THE  LAST  RECIPROCAL  DIAGONAL 

ENABLE  N+1  TO  N+K 

Mill,*)  “ M(N,*)  * RECIP(N) 

END 


LL*--augmented--optimi zed 
Number  of  PE's  required  = N+K 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39e 

R(*) 

1 

P 

R 

reciprocal  diagonals 

RECIP{N) 

N 

S 

R 

reciprocal  diagonals 

TEMP(N) 

2N 

S 

C 

contains  JCOLth  column 
of  SCM 

SQR(*) 

1 

P 

R 

square  roots  of  diagonal 
elements 

SQ(N) 

N 

S 

R 

square  roots  of  diagonal 
elements 

JROW 

1 

S 

I 

current  row  being  eliminatec 

JCOL 

1 

S 

I 

current  column  being  zeroed 
out 

L-27 


oo  n n a o onoonti 


BEGIN 


■ 


C THIS  IMPEEt!EnTATION  IS  AWICST  IDENTICAL  TO 

LL’-UNAUGriCKTED-OPTIMlHED  EXCEPT 
"EUACLE  uCOI,  TO  N"  BECC!;ES  "EI'ABLE  .7COL  TO  N+K" 

AND  MULTIPLYING  THE  LAST  ROW  BY  THE  LAST 
RECIPROCAL  DIAGONAL 

ELIMINATE  EACH  COLUIUJ 

FOR  JCOL  - 1 TO  N-1 
ENABLE  JCOL 

R(  • ) - 1 . /REAL(  IK  JCOL , • ) ) 

RECIP(JCOL)  - R(*) 

FOR  JROW  - JCOL+1  TO  N 

TEMPI  JROW)  - MIJROW,*) 

END  FOR 

ENABLE  JCOL  TO  N+K 

• NORI-IALIZE  JCOL-TH  ROW 
MIJCOL,*)  “ RECIP(JCOL)  • M(JCOL,») 

• ELIMINATE  JCOL-TH  COLUMN 
FOR  JROW  - JCOL+1  TO  N 

MIJROW,*)  “ MIJROW,*)  - TEMP(JROW)  • MCJCOL,*) 
END  FOR 
END  FOR 

CALCULATE  LAST  RECIPROCAL  DIAGONAL 
ENABLE  N 

R(*)  - 1 . /REAL!  Ml  N,*)  ) 

RECIPC  N)  - n(  •) 

MULTIPLY  THE  LAST  ROVI  BY  THE  LAST  RECIPROCAL  DIAGONAL 

ENABLE  N+1  TO  N+K 
M(N,*)  - M(N,*)  • RECIP(N) 

CALCULATE  SOUARB  ROOTS  OF  DIAGONAL  ELEMENTS 
AND  MULTIPLY  EACH  ROV7  OF  MATRIX  BY  CORRESPONDING  VALUE 

ENABLE  1 TO  N 
SQR(  *)  - 1 ./SORT!  R{  *)  ) 

FOR  JROW  “ 1 TO  N 
ENABLE  J 
SQ{  J)  - SQR(  *) 

END  FOR 

ETTAELE  1 TO  N+IC 
FOR  JROW  » 1 TO  N 

IKJROW,*)  - SQ(JROW)  • M(JROW,»> 

END  FOR 
END 


L-28 


LL*--  augmented""  unoptimized 
Number  of  PE's  required  = N+K 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39e 

R(*) 

1 

P 

R 

reciprocal  diagonals 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

TEMP 

2 

S 

C 

temporary  containing 
(JROW,JSUB)th  element 
of  SCM 

JROW 

1 

S 

I 

current  row  of  SCM  being 
calculated 

JSUB 

1 

s 

I 

row  from  which  multiple  oi 
JROWth  row  is  subtracted 
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o n n n o n o 


BEGIN 


THIS  IH^EEMEHTATION  IS  AEflOST  IBENTICAL  TO 

le*-ih’aug!:ei:ted-ukoptiiiiei;d  except 

"ErAEEE  X TO  N"  BECOIIES  "EIXAE’.E  X TO  N+K" 

AED  IIUETIPLYIKG  THE  LAST  P.OV7  EY  THE  LAST 
nECIPP.OCAL  DIAGONAL 

SUBTRACT  MULTIPLES  OF  EACH  ROW  FROM  FOLLOWING  ROWS 

FOR  JROVJ  - I TO  N-1 
ENABLE  JROW 

R(*)  - l./SQRT(  REAL(M(  JROW,*))) 

RECIP(JROW)  - R(*) 

ENABLE  JROV7  TO  N»K 

C * CALCULATE  JROW-TH  ROW 

M{JROW,*)  - HECIP(JROW)  * I JnOV7,*) 

C * SUBTRACT  MULTIPLES  OF  JROW-TH  ROW  FROM  POLLOWING  ROWS 

FOR  JSUB  - JROW+1  TO  N 
ENABLE  JSUB 
TEMP  - M(JROW,*) 

ENABLE  JSUB  TO  N+K 

. . MCJSUB,*)  - M(JSUB,*)  - TEMP  * M(JROW,») 

END  FOR 
END  FOR 
ENABLE  N 

R(*)  - l./SQRT(  REAL(  M(  N,*)  ) ) 

RECIP(  N)  - R(  *) 

C MULTIPLY  THE  LAST  ROW  BY  THE  LAST  RECIPROCAL  DIAGONAL 

ENABLE  N+1  TO  N+K 

M(N,*)  - RECIP(N)  * M(N,*) 

END 
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First  Back  Substi tution— LDL*-GE-- , „ 


Number  of  PE' 

's  required 

= N 

Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39b  (rowwise) 

S(K,*) 

2K 

P 

C 

see  Fig.  4.39c  (vectorwise) 

TEMP(K) 

2K 

S 

C 

temporary  containing  JROW-1^^ 
components  of  S 

R(*) 

1 

P 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

current  component  of  S 
being  calculated 

AUG 

1 

S 

I 

current  steering  vector 
being  calculated 

L-31 


’ nnox’T 


c sucTr..'.c7  r.ul.TiPLCG  CF  ni  '.'G  OF  n Fr.on  c 
Fon  Jro-;  - 2 ?o  n 
I 1 

. '!’(  ' = ' -'I  ■ * ) 

l.NT  : 

EF.'.Dl.C  Zr.OVi  TO  13 

c * c.'.LC’Ji,.TTE  jnov/-’;  ' c .'••onr'iTn  or  s 
FOR  AUG  » 1 TO  1. 

S(AUG,*)  - F'i'.UG,*)  - TEt’rtJ)  * COF  JG(  IK  JF.OV7- 1 , * ) ) 
END  FOR 
END  FOR 

C MULTIPLY  BY  RECIPROCI.L  DIAGONALS 

ENABLE  1 TO  N 
FOR  AUG  - I TO  K 

S(AUG,*)  " S(AUG,*)  • R(*) 

END  FOR 
END 


L-3.' 


First  Back  Substitution  LDL*-GE~'^  dZI 
Number  of  PE's  required  = N 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39b  (rowwise) 

S(N,*) 

2N 

P 

C 

see  Fig.  4.39c  (componentwise) 

TEMP(N) 

2N 

S 

C 

temporary  containing  JROWth 
column  of  L* 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

current  component  of  T 
being  calculated 

JSUB 

1 

S 

I 

component  a multiple  of  which 
is  being  added  to  JROWth 
component 

BEGIN 


C SUBTRACT  MULTIPLES  OF  EACH  ROW  FROM  FOLLOWING  ROWS 

FOR  JliOW  - 2 TO  H 
ENABLE  JROV7 
FOR  JSUB  - 1 TO  JROW-1 

TEMPCJSUB)  - MCJSUB,*) 

END  FOR 
ENABLE  1 TO  K 

C * CALCULATE  JROW-TH  COMPONENT  OF  S 

FOR  JSUB  - 1 TO  JROW-1 

S(JROW,*)  - S(JROW.»)  - CON JG( TEMP(  JSUB) ) • S(JSUB,») 
END  FOR 
END  FOR 

C MULTIPLY  BY  RECIPROCAL  DIAGONALS 

FOR  JROW-  1 TO  N 

S(JROW,*)  - S(JROW,»)  • RECIPlJROW) 

END  FOR 
END 


First  Back  Substi tution-- LDL*-GE-- 


This  implementation  first  transposes  M and  then  uses  First 
Back  Substi tution--LDL*-GE-- . 

[H] 
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First  Back  Substitution— LDL*-GE-- 
Number  of  PE's  required  = N 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39b  (columnwise) 

S(N,*) 

2N 

P 

C 

see  Fig.  4.39c  (componentwise) 

TEMP(N) 

2N 

S 

c 

temporary  containing  JROWth 
row  of  L* 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

current  component  of  T 

JSUB 

1 

S 

I 

component  from  which  a multiple 
of  the  JROWth  component  is 
substi tuted 
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BEGIN 


C SUBTHACT  MULTIPLES  OP  EACH  ROW  FROM  FOLLOV/ING  ROWS 

FOR  JROVJ  - 1 TO  N-1 
ENABLE  JROW 
FOR  JSUB  - jnOV7+l  TO  N 

TEMPCJSUB)  - M(JSUB,») 

END  FOR 
ENABLE  1 TO  K 

C * CALCULATE  JROW+l-ST  COMPONENT  OF  S 

FOR  JSUB  -JROW+1  TO  N 

S(JSUB,«)  - S(JSUB,*)  - CONJG(  TEMP(  JSUB) ) • 
END  FOR 
END  FOR 

C MULTIPLY  BY  RECIPROCAL  DIAGONALS 
FOR  JROW  - 1 TO  N 

S(JROW,*)  - RECIP(JROW)  • S(JROW,*) 

END  FOR 
END 


S(  JROW,«) 


L-37 


"■'I] 

First  Back  Substitution--LL*-- 

1.1JJ 

Number  of  PE's  required  = N 


Variable  Name 

Length 

Location 

Type 

Descri ption 

M(N,*) 

2N 

P 

L 

see  Fig.  4.39b  (rowwise) 

S(K,*) 

2K 

C 

see  Fig.  4.39c  (vectorwise) 

TEMP(K) 

2K 

s 

C 

St 

temporary  containing  JROW-1 
components  of  S 

RECIP(N) 

N 

C 

R 

reciprocal  diagonals 

JROW 

1 

s 

I 

current  component  of  S being 
calculated 

AUG 

1 

s 

I 

current  steering  vector  being 
calculated 
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BEGIN 


C SUBTPJVCT  MULTIPLES  OF  HOWS  OF  M FROM  S 

FOR  JROV;  - 2 TO  N 
ENABLE  JROW-1 
FOR  AUG  - 1 TO  K 

C « • MULTIPLY  BY  RECIPROCAL  DIAGONAL 

SCAUG,*)  - S(AUG,»)  • RECIP(  JROV7-1) 

TEMP(AUG>  - S(AUG,*) 

END  FOR 

ENABLE  JROW  TO  N 

C • CALCULATE  JROV7-TH  COMPONENTS  OF  S 

FOR  AUG  - 1 TO  K 

S(AUG,*)  - S(AUG,»)  - TEMP(AUG)  • CON JG(  M(  JROW-1 ,•) ) 
END  FOR 
END  FOR 

C MULTIPLY  LAST  COMPONENT  BY  LAST  RECIPROCAL  DIAGONAL 

ENABLE  N 

FOR  AUG  - 1 TO  K 

SCAUG,*)  - S(AUG,«)  • RECIP(N) 

END  FOR 
END 
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First  Back  Substitiition--LL*-- 
Number  of  PE's  required  = N 


Variable  Name 

Length 

Location 

Type 

Descri ption 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39b  (rowwise) 

S(N,*) 

2N 

P 

C 

see  Fig.  4.39c  (componentwise) 

TEMP(N) 

2N 

S 

C 

temporary  containing  the  JROWth 
column  of  L* 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

current  component  of  T being 
calculated 

JSUB 

1 

S 

I 

component  a multiple  of  which 
is  being  added  to  JROWth 
component 
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n o 


BEGIN 


SUDTPACT  MULTIPLES  OF  EACH  EOV7  FP.OM  PP.ECEDING  HOWS 
CALCULATE  FIRST  COMPONENT  OF  S 

ENABLE  1 TO  K 
S(  1,*)  - S<  1,*)  • RECIP(  1) 

FOR  JROW  « 2 TO  N 
ENABLE  JROV7 
FOR  JSUB  - 1 TO  JROV;-l 

TEMP(JSUB)  - M(JSUB,») 

END  FOR 
ENABLE  1 TO  K 

C * CALCULATE  JROW-TH  COMPONENT  OF  S 

FOR  JSUB  - 1 TO  JROW-1 

S(JROW,*)  - S(JROW,»)  - COKJG(  TEMP(  JSUB) ) • 
END  FOR 

C • MULTIPLY  BY  RECIPROCAL  DIAGONAL 

S(JROV7,*)  - SCJROW,*)  » RECIP(JROW) 

END  FOR 
END 


L-41 


S{ JSUB,*) 


First  Back  Substitution--LL*-- 


[h. 

LH] 


This  implementation  first  transposes  M and  then  uses  First  Back 
Substi tution--LL*-- 


First  Back  Substitution--LL*--  f^y  [T^J 
Number  of  PE's  required  = N 


Variable  Name 

Length 

Location 

Type 

Descri ption 

M{N,*) 

2N 

P 

C 

see  Fig.  4.39b  (columnwise) 

S{N.*) 

2N 

P 

C 

see  Fig.  4.39c  (componentwise) 

TEMP{N) 

2N 

S 

C 

temporary  containing  JROWth 
row  of  L* 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

current  component  of  T 

JSUB 

1 

S 

I 

component  from  which  a multiple 
of  the  JROWth  component  is 
subtracted 
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o n 


BEGIN 


SUJirn.ACT  HULTIPLnS  OF  EACH  F.OV;  FROM  FOLLOWING  ROWS 
CATXULATE  FIRST  COMPONENT  OF  S 

ENABLE  1 TO  K 
S(  1,’*)  - S(  1,*)  * RECIP(  1) 

FOR  JROV7  - 1 TO  N-1 
ENABLE  JROV7 
FOR  JSUB  - JROW+1  TO  N 

TEMP(JSUB)  * M(JSUB,») 

END  FOR 
ENABLE  1 TO  K 

C • CALCULATE  JROW+l-ST  COMPONENT  OP  S 

FOR  JSUB  - JROW+1  TO  N 

S(JSUB,*)  - S(JSUB,»)  - CONJGf TEMPC JSUB) ) • 
END  FOR 

C * MULTIPLY  BY  RECIPROCAL  DIAGONA:^ 

S(JROW  + l,*)  - S{JROW+l,*)  • RECIPl  JROW+1) 

END  FOR 
END 


L-44 


S< JROW,*) 


i 


Second  Back  Substitution--LDL*-6E-- 

[m 


This  implementation  transposes  M first  and  then  uses  Second  Back 
Substi  tution--LDL*-GE--  . 

[E] 


L-45 


Second  Back  Substi  tution--LDL*-GE--  ij 
Numoer  of  PE's  required  = N+K  [N] 


Variable  Name  Length  Location  Type  Description 


M(N,*) 

2N 

P 

c 

see  Fig.  4.39e  (rowwise) 

[see  Fig.  4.39b  (rowwise)] 

[S(N,*) 

2N 

P 

c 

see  Fig.  4.39c  (componentwise)] 

TEMP(N) 

2N 

S 

c 

temporary  containing  JROWth 
column  of  M 

JROW 

1 

s 

I 

last  calculated  component  of  S 

JSUB 

1 

s 

1 

component  of  S from  which  a 
multiple  of  the  JROWth  component 
is  subtracted 
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BEGIN 


C SUBTHACT  MULTIPLES  OF  EACH  COMPOI^EKT  FHOM  PnECEDIKG  COMPONENTS 

FOP.  jpov;  - N TO  2 BY  -1 
ENABLE  jpov; 

FOP  JSUB  - 1 TO  JP.OV;-! 

TEMP(JSUn)  - MCJSUB,*) 

END  FOR 

ENABLE  N+1  TO  N+K  C 1 TO  K I 

C • SUBTRACT  MULTIPLE  OP  JROV;-TH  COMPONENT  FROM  PRECEDING  COMPONENTS 

FOR  JSUB  - 1 TO  JROW-1 

. . HCJSUB,*)  - M(JSUB,*)  - TEMP(JSUB)  * MCJROW,*) 

*.  . C S(JSUB,*)  - StJSUB,*)  - TEHP(JSUB)  • S(JROW,»)  ] 

END  FOR 
END  FOR 
END 


L-47 


Variable 

M{N.*) 

S(K,*) 

TEMP(N) 

JROW 

AUG 


Second  Back  Substi tution--LDL*-GE-- 
Number  of  PE's  required  = N 


[J.D 


Name  Length  Location  Type 
2N  P 

2K  P 

2N  S C 

1 S I 

1 S 1 


Descri ption 

see  Fig.  4.39b  (columnwise) 

see  Fig.  4.39c  (vectorwise) 

temporary  containing  JROWth 
components  of  S 

current  row  of  M a multiple 
of  which  is  subtracted  from  S 

current  steering  vector 
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BEGIN 


C SUBTRACT  FROM  THE  COMPONENTS  OF  S MULTIPLES  OF  THE  ROWS  OF  M 

FOR  JROV;  - N TO  2 BY  -1 
ENABLE  JROV; 

FOR  AUG  - 1 TO  K 

TEMP(AUG)  - S(AUG,*) 

END  FOR 

ENABLE  1 TO  JP.OV;-! 

L * DO  EACH  STEERING  VECTOR 

FOR  AUG  - 1 TO  K 

S(AUG,»)  - S(AUG,»)  - TEMP(AUG)  » M(JROW,*) 

END  FOR 
END  FOR 
END 


L-^9 


Second  Back  Substi tution--LDL-GE--| T (11111 
Number  of  PE's  required  = N+K  [N] 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39e  (columnwise) 

[see  Fig.  4.39b  (columnwise)] 

[S(N,*) 

2N 

P 

C 

see  Fig.  4.39c  (componentwise)] 

TEMP(N) 

2N 

S 

C 

temporary  containing  JROWth 
row  of  L 

JROW 

1 

s 

1 

current  component  of  S 
being  calculated 

JSUB 

1 

S 

1 

component  of  S a multiple  of 
which  is  being  subtracted 
from  the  JROWth  component 

L-r.O 


EEGIH 


C SUSTRACT  FROM  EACH  COIlPO"ENT  MULTIPLES  OF  FOLLOVJirG  COMPONENTS 

FOR  JROV;  - M-1  TO  1 BV  -1 
ENABLE  JROV; 

FOR  JEUB  - Jn017+1  TO  N 

TEIIP(JSUB)  - M(JSUB,«) 

END  FOR 

ENABLE  H+1  TO  IJ+K  C 1 TO  K ] 

C • CALCULATE  JROt7-TH  COMPONENT 

FOR  JSUB  - JROVJ+1  TO  N 

M(JROW,*)  - M(JROW,«)  - TEMP(JSUB)  • M(JSUB,*) 

. C S(JROV/,  •)  - S(JROV/,  ) - TEMP(JSU3)  * S(JSUB,»)  3 

END  FOR 
END  FOR 
END 


L-51 


) 


j 


L-52 


Second  Back  Substitution— LL*--  Q^D 

Number  of  PE's  required  = N+K  [N] 


Variable  Name 

Length 

Location 

Type 

Description 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39e  (rowwise) 

[see  Fig.  4.39b  (rowwise)] 

[S(N,*) 

2N 

P 

C 

see  Fig.  4.39c  (componentwise)] 

TEMP(N) 

2N 

S 

C 

temporary  containing  JROWth 
column  of  M 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

last  calculated  component  of  S 

JSUB 

1 

S 

I 

component  of  S from  which  a 
multiple  of  the  JROWth 
component  is  subtracted 
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n n 


BEGIN 


SEDTEACT  MULTIPLES  OF  EAtl!  COIlPOl-EITT  FHOIl  PPECEOING  COMPONENTS 
MULTIPLY  LAST  COMPONENT  BY  LAST  RECIPROCAL  DIAGONAL 

ENABLE  tI+1  TO  N+K  C 1 TO  K J 

M(N,*)  ” 'NN,*)  • RECIP(N)  [ S(n,«)  " Gdl,*)  • RECIP(H)  ] 

FOR  jnOV/  - N TO  2 BY  -1 
ENABLE  JROW 
FOR  JSUB  - ■■  TO  JROW-1 

TEHP(JSUB)  - M(JSUB,») 

END  FOR 

ENABLE  N+1  TO  N+K  C 1 TO  K J 
C • SUBTRACT  MULTIPLES  OF  JROW-TH  COMPONENT  FROM  PRECEDING  COMPONENTS 

FOR  JSUB  - 1 TO  JROVJ-1 

M(JSUB,*)  - M(JSUB,*)  - TEMP(JSUB)  • M(JROW,*) 

. C S(JSUB,«)  - S(JSUB,»)  - TEHPCJSUB)  * S(JROW,*)  ] 

END  FOR 

C • MULTIPLY  BY  RECIPROCAL  DIAGONAL 

M(  JROW-1,*)  - M(  JROW-1,*)  * RECIP(  JROW-1) 

*.  C S(  JROW-1,*)  “ S(  JROW-1,*)  * RECIP( JROW-1)  1 

END  FOR 
END 


Second 

Back  Substitution-- 

U_i^ 

[EE 

Number  of  PE's 

required 

= N 

Variable  Name 

Length 

Location 

Type 

Descri ption 

M(N,*) 

2N 

P 

C 

see  Fig.  4.39b  (columnwise) 

S(K,*) 

2K 

P 

C 

see  Fig.  4.39c  (vectorwise) 

TEMP{N) 

2N 

S 

C 

temporary  containing  JROWth 
comporents  of  S 

R(*) 

1 

P 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

current  row  of  M a multiple 
of  which  is  subtracted  from  S 

AUG 

1 

S 

I 

current  steering  vector 
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BEGIN 


C SUBTRACT  FIICM  THE  COUPON  :MTS  OF  S MULTIPLES  OF  ROWS  OF  M 

FOR  CnuH-J  ^ K TO  2 BY 
EKADLE  JROV? 

FOR  AUG  - 1 TO  K 

S(AUG,*)  ” E(AUG,*)  • R(*) 

TEMP(AUG)  ” 3(AUy,*) 

END  FOR 

ENABLE  1 TO  JROW  } 

C * DO  EACH  STEERING  VEJE'OR 

FOR  AUG  - 1 TO  K 

S(AUG,*)  - S<A..7G,»'  - TEMP(AUG)  * M(JROW,*) 

END  FOR 
END  FOR 

C MULTIPLY  FIRST  COMPONENT  BY  FIRST  RECIPROCAL  DIAGONAL 

ENABLE  1 

FOR  AUG  - 1 TO  K 

S(  AUG,»)  - S(  AUG,  • > ■ Rf 

END  FOR 
END 


L-56 


Second  Back  Substitution--LL*-- 


Number  of  PE's  required  = N+K  [Nj 


Variable  Name 

Length 

Location 

Type 

Description 

2N 

P 

C 

see  Fig.  4. 39e( columnwise) 

[see  Fig.  4.39b  (columnwise)] 

[S(N,*) 

2N 

P 

C 

see  Fig.  4.39c  (componentwise)] 

TEMP(N) 

2N 

S 

C 

temporary  containing  JROWth 
row  of  L 

RECIP(N) 

N 

S 

R 

reciprocal  diagonals 

JROW 

1 

S 

I 

current  component  of  S 
being  calculated 

JSUB 

1 

S 

I 

component  of  S a multiple  of 
which  is  being  subtracted 
from  the  JROWth  components 
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o n 


BEGTH 


SUBTRACT  FROM  EACH  COtlPOITENT  IIULTIPI.ES  OF  FOBBOWirG  COMPOrCNTS 
nULTIPLY  LAST  COtlPOKElIT  EV  LAST  PECIPP.OCAL  DLACCCLAL 

EFABLE  TO  IJ+K  C 1 TO  K 3 

r.dJ,*)  - IKN,*)  * PECIPCN)  C S(N,»)  - SCN,*)  * RECIP(N)  ] 

FOP  JPOW  - N-1  TO  1 BY  -1 
EMABLE  JP.OVJ 
FOR  JSUB  - jnOV/+l  TO  N 

TEMP(JSUB)  “ H(JSUB,») 

END  FOR 

ENABLE  N+1  TO  N+K  C 1 TO  K 3 
C * CALCULATE  JPOW-TH  COMPONENT 

FOR  JSUB  - JROW+1  TO  N 

M(JROV7,»)  - M(JROV7,*)  - TEtlP(JSUB)  * M(  JSUB , * ) 

*.  . [ SIJROV/,*)  - SCJPOW,*)  - Tr,MP(JSUB)  • SCJSUB,*)  3 

END  FOR 

C * MULTIPLY  BY  RECIPROCAL  DIAGONAL 

MCJP.OW,*)  “ H(JROW,*)  * r.EClP(JROW) 

».  C SCJROVJ,*)  - * RECIP(JROW)  3 

END  FOR 
END 
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